AMERICAN 
OURNAL OF MATHEMATICS 


FOUNDED BY THE JOHNS HOPKINS UNIVERSITY 


EDITED BY 


Be E. T. BELL ABRAHAM COHEN 
CALIFORNIA INSTITUTE OF TECHNOLOGY THE JOHNS HOPKINS UNIVERSITY 


E. W. CHITTENDEN F. D. MURNAGHAN 
UNIVERSITY OF IOWA THE JOHNS HOPKINS UNIVERSITY 


J. F. RITT 
COLUMBIA UNIVERSITY 


WITH THE COOPERATION OF 


FRANK MORLEY MARSTON MORSE 


W. A. MANNING E. P. LANE 
HARRY BATEMAN ALONZO CHURCH 


HARRY LEVY L. R. FORD 
J. R. KLINE OSCAR ZARISKI 


PUBLISHED UNDER THE JOINT AUSPICES OF 


THE JOHNS HOPKINS UNIVERSITY 
AND 


THE AMERICAN MATHEMATICAL SOCIETY 


Volume LIX, Number 2 
APRIL, 1937 


THE JOHNS HOPKINS PRESS 
BALTIMORE, MARYLAND 
U. 8. A. 


| 
‘ 
af 
& 
us 
G. C. EVANS 
AUREL WINTNER 
GABRIEL SZEG 
. 
— 
‘ 
ees 
F 


CONTENTS 


Astronomical consequences of the relativistic two-body — By 
Levi-Crvira, 

Finite deformations of an elastic solid. By F. D. MurnacHan, ; : 

Mean motions and distribution functions... By Hartman, E.R. 
VAN Kampen and AUREL WINTNER, 

On an absolute constant in the theory of variational stability. By E. RB. 
van Kampgn and WINTNER, 

On the expansion of the remainder in the open-type Newton-Cotes 
quadrature formula. By Orvittz G. Hargoxp, JR., 2 

On certain fundamental identities due to Uspensky. By W. A. "Dwrer, 

An extension of Bernstein’s theorem associated with general boundary 
value problems. By W. H. McEwen, . 

Abstract covariant vector fields in a general absolute calculus. By A. D. 
MicHar, . ‘ 

A type of homogeneity for continuous curves. By Cuarizs H. 

A navigation problem in the calculus of variations. By E. J. McSHavz, 

The topological discriminant group of a Riemann surface of _— P. 
By Oscar 

On those points of an algebraic manifold not reachable by a given para- 
metric representation. By J. F. Daty, . * 

A remark concerning the parametric representation of an algebraic 
variety. By Oscar ZaRIsK1, 

On the construction of symmetric ruled surfaces, By ARNotp Emon, 

On circles connected with three and four lines. By J. R. Musse~man, 

On the densities of infinite convolutions. By AurEL WINTNER, ‘ 

A representation of Stieltjes integrals by conditionally pereceet series. 
By Fritz JoHN, . 

Note on the definition of fields by independent postulates 4 in terms of the 
inverse operations. By Davin G. RaBinow, 

The representation of integers as sums of values of cubic "polynomials. II. 
By R. D. 

The conjunctive equivalence of pencils of hermitian and anti-hermitian 
matrices. By JoHN WILLIAMSON, 

Some remarks on class field theory over infinite fields of ‘algebraic numbers. 
By O. F. G. 414 

On the addition of convex curves. II. By RICHARD KERSHNER, ; ~ 423 

Real canonical binary trilinear forms. By Rurvus OLDENBURGER, » aq 

A remark on a theorem of Arzela. By Puitip Harrman, 3 . 436 


THE AMERICAN JOURNAL OF MATHEMATICS appears four times yearly. 

The subscription price of the Journat for the current volume is $7.50 (foreign 
postage 50 cents); single numbers $2.00. 

A few complete sets of the JouRNAL remain on sale. 

Papers intended for publication in the JounNaL may be sent to any of the Editors. 

Editorial communications may be sent to Dr. A, Conen at The Johns Hopkins 
University. 

Subscriptions for the JourNAL and all business communications should be sent to 
THe JouHNs Hopkins Press, Bartrarore, Marrianp, U.S. A. 


Entered as second-ciass matter at the Baitimore, Maryiand, Postoffice. acceptance for mailing at special 
rate of postage provided for in Section 1108, Act of October 8, 1917, Authorized on July 8, 1918. 


PRINTED IN THE UNITED STATES OF AMERICA 
BY J. H. FURST COMPANY, BALTIMORE, MARYLAND 


PAGE 
4 
4 
4 
q 
be 


E 
5 4 
a 
a 
a 
i 
‘ 
3 
4 


aa 
> 
: : 
4 
i 
3a 
| { 


ASTRONOMICAL CONSEQUENCES OF THE RELATIVISTIC 
TWO-BODY PROBLEM.* 


By Tutuio Levi-Civita. 


1. Mechanical laws, according to Einstein’s theory, are much more com- 
plicated in conception than under the assumptions of Newton. However the 
motion of celestial bodies under ordinary circumstances differ so little from 
their Newtonian representation, that, for astronomical purposes, relativistic 
effects may be conveniently treated as first-order perturbations. 

A good amount of work in this direction was done, shortly after the 
appearance of general relativity, with deep insight and high competence by 
the late Professor De Sitter.? 

The simple case of two bodies of comparable masses lies beyond De Sitter’s 
developements, which were chiefly directed towards the inclusion of perturba- 
tions arising from relativity in the standard equations concerning planets and 
satellites of our solar system, where one of the masses predominates. 

[ have recently taken up the question,’ paying due attention to the case 
of comparable masses. For the usual two-body problem, which in the tradi- 
tional hierarchy comes immediately after Einstein’s one-centre problem, the 
equations of motion are certainly integrable if one treats relativistic effects as 
first-order perturbations. 

We intend to say a few words about deduction and illustration of two 
inequalities which are already apparent or, at least, may shortly appear in the 
observable field. 


2. Let us start from the explicit form of the two Lagrangian functions 
L, and 


which define the absolute motion of the centres of mass, Py and P,, of two 
celestial bodies; e. g., a double star. 
I suppose that everything has already been reduced to ordinary space, xn! 


* A paper delivered at the Tercentenary Conference of Arts and Sciences at Harvard 
University, September 4, 1936. Received by the Editors January 18, 1937. 

* Monthly Notices, Royal Astronomical Society, vol. 77 (1916), pp. 155-184 (Second 
paper). 

*“ The relativistic problem of several bodies,’ 
vol. 59 (1937), pp. 9-22. 


American Journal of Mathematics, 


225 


i 
i 
4 
| 
{ 
i 


226 TULLIO LEVI-CIVITA. 


(i= 1,2,3) being Cartesian codrdinates of P, =0,1) with reference to 
some fixed or Galilean frame, while the independent variable is x° = ct (¢ usual 
time and c¢ velocity of light). 
The Lagrangian equations, furnished by the Zn (kh —0,1), define the 
components 
Ent (h = 0,1) 


of the absolute accelerations of the two points P, as functions of positions and 
velocities. At this stage, all textbooks introduce relative codrdinates 


zt = (4 = 1, 2, 3) 


and corresponding relative accelerations 


Zot (i == 1, 2, 3) 


simply by subtraction. Then, on account of the fact that all but the New- 
tonian terms are of the second order, we are allowed to use Keplerian values, 
and especially to employ the classical integrals of energy and areas. 

If the simple but rather tedious developments are performed with some 
insight into the matter, we recognize that relative motion may also be brought 
under the Lagrangian scheme. The corresponding function L is 


(1) 


where, to within a constant factor, N designates the usual Newtonian term 
and II the additional relativistic contribution. More precisely, if mo, m, are 
the masses of the two bodies and r— P,P, is their mutual distance, then 
putting 


(2) mM == My + mM, 
and 
dat 3 fm 1 
( ) B B 2 B 7 1 


we obviously have 
(4) N = $f* +-y. 


In the expression of II we shall denote by e the difference 


which, in Newtonian approximation, is nothing but the constant of energy 
divided by c*, so that, up to terms of second order, the numerical value of ¢ 
behaves like a constant. On the other hand, it is not the same to apply the 


Lagrangian operator 


an 


in’ 


At 


sho’ 


| 

te 
a 
it 
O} 
ol 
! 
(2 
| 
(€ 
or 
(7 

| a 

a 
me 


1e 


ASTRONOMICAL CONSEQUENCES OF RELATIVISTIC TWO-BODY PROBLEM. 227 


dx® dB; 


to the constant e, which gives zero, as to the binomial $8? — y, which gives 
Bi + dy/dx', or, up to the first order, 20y/dx*. Therefore, if we include e 
among the arguments by means of which the explicit expression of II is built, 
it is necessary to state whether e is to be treated, in performing Lagrangian 
operations, as a genuine constant or as the difference $8? — y, which, as far as 
first approximation is concerned, has the same numerical value. 

Attributing to e at any moment the réle of a siinple constant, I have 


obtained 
1 
(5) (2— dp) (1 + + 2(—1 + + Gy’, 
where 
MoM, 
(6) 


and 1/a is a further dimensionless constant, connected, within terms of higher 
order, with the double areal velocity C by the relation 


1 C 
(7) 


Accordingly, in ordinary planetary pairs and double stars, a~ £, i.e., 
a has the same order of magnitude as 8. In particular, for circular motion, 
a is the constant value of 8. This is verified at once by means of the ele- 
mentary relations 


__ fm 1 C 


B? 


in which a means the radius or, more generally, the major semi-axis of the orbit. 
Solving for fm/c* and C/c, and substituting in (7), we get simply 


At any rate, from a~ B and y ~ B’, it follows that 


] 


showing that the last term of II is, like the others, of the second order. 
Squaring (7) and remembering the classical relation 


| 

al 

= 

id 

a 

i 

W- 

8, 

ne 

q 

ht 

| | 

re 

ell 

q 

By 

ae | 

he | 
RY 


TULLIO LEVI-CIVITA. 
C? = fma(1— e?) (e = eccentricity), 
we get 


Cc? 
which will be used later on. 


3. The expression (1) of Z has, in view of (3) and (5), the form 


(8) L = + 4, 
where both y and ¢ depend exclusively on the mutual distance 1, since 
(9) y—1+ (4—p)y, 

(10) (1+ + 2(—1 + 


The motion defined by the Lagrangian function Z admits the integral 


(11) —o = e*, 


where the constant e* may differ from e only in second order terms. 
Now an equivalence theorem ‘in analytical dynamics* states that the 
Lagrangian function 


(12) L, = $8? + 4(¢ + e*), 


for which the motion admits the integral 


(13) — + e*) =const., 


gives rise, for the value 0 of the constant in the second member, to a family 
of trajectories identical with those defined by (8) and the integral (11). 

Therefore, as far as trajectories are concerned, our task is reduced to 
characterize those belonging to the Lagrangian function (12). The latter 
corresponds to the motion of a free particle in ordinary space under a con- 
servative (even central) force having the force-function 


b= y(¢ + e*). 


Omitting the additive constant e* and all terms of order higher than two, 
and writing correspondingly e instead of e* in all second order terms, we have, 
by (9) and (10), 


® Cf., e. g., Levi-Civita and Amaldi, Lezioni di meccanica razionale, vol. II, (Bologna, 
Zanichelli, 1927), pp. 514-515. 


ASTRONOMICAL CONSEQUENCES OF RELATIVISTIC TWO-BODY PROBLEM. 229 


+ (2+ Spey + 3(1— 


Putting for a moment 


m* —=m{1+ (2+ 3p)e} and y* (2 + 


we may write simply y* instead of y in the second and third terms of ®. 
Here, however, it is indifferent, up to the second order, whether one employs 
y or y*. Hence, omitting asterisks, we may consider the trajectories of a 
central force with the potential function 


(I) + 
where, from (2), (3), (6) and (7), 


m 1 Mom 1 C 
(II) m=m-+m, a cfm/c’ 


Note that, in these formulae (owing to the described policy of first introducing 
m* and y* and then suppressing asterisks), m ) and m, do not represent exactly 
the ordinary masses of our two bodies (which had been introduced in 2), 
but truly these masses, slightly altered by the constant factor 


1+ (2+ 3p)e. 


What essentially matters is that they, like their sum m, behave as constants 
driving the motions to be now considered. 

Obviously, the first term in (I) represents the Newtonian attraction, while 
the other two (both of the second order). are the relativistic perturbations 
consisting of central attractions; the one varying according the inverse cube, 
the other according the inverse fourth power of the distance. For the Ein- 
steinian case of one-centre problem, one has only to put p= 0, and (I) gives 
the well known expression 3y for the perturbative function. 


4. Orbits described under central forces were thoroughly investigated in 
the 18th and 19th centuries. Especially for orbits which may be regarded as 
disturbed Keplerian ellipses, computations of apsidal angles, and. corre- 
sponding precessions of perihelia may be obtained by elementary methods. 
In this way, with first order accuracy, the angular precession (per revolution) 
of the perihelion or, in the case of a double star, of the periastron, is found 
to be 
(IIT) o =o, = 


With the value (7) of a*, namely 


1 
| 
Fi 
| | 
i 
| 
{ 
i 
i 
i + 
a 
i 


230 TULLIO LEVI-CIVITA. 


fm 1 

a(l—e?*)’ 
the expression o¢ = 610° of o is exactly the precession predicted by Einstein 
for an infinitesimal planet (p = 0) in the case of motion about a central mass 
possessing the total mass m of the binary system. Therefore, within the 
required approximation, Einstein’s formula, first established for an infinitesi- 
mal body in the (relativistic) field of a central mass m, is still valid for two 
bodies of any masses mo, m, whose sum is m. 

I hope that, for some double stars, the precession of the periastron of the 
satellite star may be observed with sufficient accuracy to test the theoretical 
result, thus affording a new astronomical confirmation of Einstein’s gravita- 
tional theory. For the moment I can only draw attention to the matter. 


5. The above prediction refers to relative orbits. Another, and perhaps 
more striking, theoretical deduction concerns the abseluie motion in the sky 
of any double star system. 

It is well known that general relativity does not include, as a rigorous 
law, the principle of reaction nor its most popular dynamical consequence, 
concerning the motion of the center of mass in case of absence of external 
forces. Accordingly, we can no longer rely upon the constance of the absolute 
velocity G = dG/dzx® of the centre of mass G of a double star, but are, on the 
other hand, enabled to infer the expression of its instantaneous (absolute) 
acceleration by employing the relativistic treatment of the two-body problem. 

The general idea is obvious. Having the Lagrangian equations of motion 
for the two bodies Py and P,, equations obtained in my previous paper, we 
deduce at once the vectors 6, and 6, of the absolute accelerations as functions 
of their relative positions, and (still) absolute velocities. 

The combination 


is precisely the velocity of G when referred to z° as the time variable, i. e., the 
ordinary velocity divided by c; while the acceleration of G, again referred 
to 2°, is 


(14) = — (mf, + mf.) , 


where the dot denotes d/dz°. 

In view of the classical mechanics, which always holds in the first ap- 
proximation, we may anticipate that the Newtonian terms in the right-hand 
member of (14) disappear, so that there remains only the relativistic correc- 


ASTRONOMICAL CONSEQUENCES OF RELATIVISTIC TWO-BODY PROBLEM. 231 


tion, expressed, as before, in terms of relative positions and absolute velocities. 
Now, if we consider actual double star motions, it is plainly permitted, within 
the degree of approximation in which we are interested, to introduce New- 
tonian values referring to Keplerian motions of negative energy. 

In order to perform the computation along the lines indicated above, it 
will be convenient to use relative coordinates of invariable direction and having 
their origin in Py), where mp is the principal star (mo = m,), and to choose 
the orthogonal trihedron Pox'a’a* in its standard position: Pox’ towards the 
periastron of the (undisturbed) elliptical orbit of P;; Pox? in the plane of this 
orbit and rotated 90° in the sense of the motion; Pov* forming a right-handed 
trihedron with the preceding two. 

First we recognize from the outlined formulae that the component & = 0. 
Therefore, the acceleration of the center of mass G of a double star lies entirely 
in the plane of (relative) orbit; we may also say that it lies in the common 
plane of (absolute) orbits, described by Py and P, about G. 

For the two components 

da, dds 


= = 
dx® dx® 


in the orbital plane I have found 


5) au d dy x 1. 
(15) — — +9 (C+ ar) }, 


where the factor 


(16) 


m 


is proportional to the difference of the masses, while 8; and y have the same 
significance as in (3), p being defined by (6), a by (7), while e is the (nega- 
tive) total energy of the undisturbed Keplerian motion. 


6. As already remarked, Keplerian values when used in the right-hand 
members of (15) give sufficient accuracy. Then the explicit determination of 
the two variable components of velocity, a;(°), requires only quadratures, 
easily performed by introducing the true anomaly 6 instead of 2° by means 


of the relation 


2 


do C_ Vfma(i —e’) 
ae’ ¢ 


and remembering that 
__fm 1_ fm 1+ ecos6 


— —— 


a(l1—e?) 


| 
4 
} 


232 TULLIO LEVI-CIVITA. 


Periodical terms in 6 correspond to small fluctuations in the components @, 
and @2, fluctuations which are repeated during every revolution and certainly 
remain within the limits of accuracy of observation in the case of all the 
known double stars. Accordingly, the only interesting terms are the secular 
terms, whose effects accumulate during the successive revolutions. Now we 
must remember that «; mean components of velocity with respect to the 
(Roemerian) time x° ct. Therefore the components of the ordinary velocity 
of G are 

Denoting by c%, and ca, their secular parts in terms of 6, we obtain finally, 
in virtue of (7’), 

The final conclusion is that the secular acceleration of the center of mass 
G of the double star is directed along the major axis towards the periastron 
of the principal star. The amount of this secular acceleration may be con- 
veniently expressed as the increase of velocity in Km/sec per revolution. 

To this end, we first introduce the mass of the Sun, mo, and write 


fmm fme 

The second factor is a length, the so called gravitational radius lo of the Sun, 
having a value of the order of magnitude of a Kilometer, or about 1,5 Km. 


We may then write 


On the other hand, the mean motion = of the double star is =, where 


T is the period of revolution. Of course, 7’ refers to the unit of time used 
previously in f and c. Starting with the C. G.S. units, we have 7 in seconds. 
But, in data of double stars, 7’ is generally expressed in days (for spectroscopic 
binaries) or in years (for visual binaries). Choosing the first case and writing 
T* to avoid ambiguity, we have 


T T* 86164° 
It follows then, putting 6 = 2m in the preceding expression of ca, and con- 


sidering its absolute value V, that the increase AV of the velocity of G during 
a revolution is 


fm m 
—=-—-1,5 Km. 
Mo 


ASTRONOMICAL CONSEQUENCES OF RELATIVISTIC TWO-BODY PROBLEM. 233 


47” 


per revolution mig (1—e?)*? mo Mo 86164 > Km/see. 


(IV) (AV) 


The number of revolutions per day is 1/74, and, per century, 100 - 365, 25 
times 1/T¢. Since 
4a 2 


1,5: 100: 365, 25 — 12, 55, 


the increase of the velocity of the center of mass during a century is 


(V) (AV)... contury= 12, 55pd Km/sec. 

7. Such a difference of velocity along the apsidal line, having a com- 
ponent also in the line of sight, ought to be detectable eventually by spectro- 
scopic obseryation. 

As far as numerical values are concerned, we recognize from (V) [or 
from (IV) ]| that the most favorable circumstances are realized for double stars 
having the following properties : 


a) short period, i.e., stars very near each other, which strongly in- 
fluences 1/T%; 

b) total mass m, large (or at least not too small) in comparison with the 
mass of the Sun mo; 

c) pronounced eccentricity, on account of the factor e/(1— e?)*”; 

d) comparable masses, but not nearly equal, owing to the factors 


(mo + m,)?’ my +m," 


The best conditions for pd are realized by mass-ratios 


p= 


m Mo 
== 7 = |] — 7 


Mo + Mm, My + 
for which the polynomial 
pd = «(1 — x) (1 — 


attains its maximum. This takes places for «=—4(1—3-*%), or, 
roughly, for two stars containing respectively 4 and % of the total 
mass of the system. The corresponding value of pd is about 0,1. 


I am well aware of the astronomical observations which have shown that, 
in general, the eccentricity e decreases with 7%; so it will not be easy to find a 


} 
| 
} 
| 
| 
H 


234 TULLIO LEVI-CIVITA. 


binary for which the two requirements a) and c) are equally well satisfied. 
On the other hand, a) predominates in (V), 1/T% appearing to the second 
power ; furthermore, b) is, in the main, in accordance with a). 

As the visual binaries have, in general, long periods (some years), they 
are not to be expected, on account of a), to be advantageous for testing the 
formula (V). This formula requires, however, the knowledge of the masses 
of the principal and the companion star. Accordingly, it will be advisable to 
turn to the class of binaries for which photometric as well as spectrographic 
observations are available. In order to consider, at least, one example with a 
reasonable (AV) inacentury, I have looked into Moore’s Tables of the Lick 
Observatory,‘ stopping at N° 28, Persei, for which, unfortunately, only 
spectroscopic data are certain. 

The tabulated elements are 


T¢=—=1,52, e=0,2% 


t being the hitherto unknown inclination of the orbital plane to the tangential 
plane of the celestial sphere. Whatever i may be, we have finally for b+ Persei 


m 1, 08 


Meo sin? 7’ 


while p and b, involving only mass-ratio, are independent of i. Their product 


has the value 
pd — 0, 09622. 


Accordingly, formula (V) gives for b? Persei 


0.22. 


(AV) in a contury— 12, 5B 0, 09622 - (0, sin? i 
0,13 
Km/sec. 


The presence of the (yet unknown) divisor sin*i is consistent with the 
hope that AV may become appreciable much earlier than in a century; perhaps 
even in a few years. 


UNIVERSITY OF ROME. 


“Or rather to the Resumé, reported in Armellini’s Astronomia siderale, vol. II 
(Bologna, Zanichelli, 1931), Appendix 3. 


0, 85 0, 23 
me, 
sin’ t sin* 4 


FINITE DEFORMATIONS OF AN ELASTIC SOLID.* 


By F. D. MurNAGHAN. 


Introduction. In the classical theory of elasticity a deformation (= strain) 
is termed infinitesimal when the space derivatives of the components of the 
displacement vector of an arbitrary particle of the medium are so small that 
their squares and products may be neglected. Many attempts have been made 
to extend the classical theory of infinitesimal strain to the case of finite strains 
i.e. strains in which the fundamental hypothesis which serves to define an 
infinitesimal strain is not legitimate. The more important of these are given 
in the references numbered 1 to 7 at the end of the present paper. A good 
summary with many references may be found in the address of Professor 
Signorini (8) at the Palermo meeting (1935) of the Societa Italiana per 
il progresso delle scienze. In the case of a finite strain there are two essentially 
different viewpoints which coalesce when the strain is infinitesimal: we may 
use as the independent variables in terms of which the strain is described either 

(a) the codrdinates of a typical particle of the medium in the initial or 
unstrained position or 

(b) the coordinates of a typical particle in the final or strained position. 
Adopting the terminology familiar in the corresponding situation in hydro- 
dynamics we refer to these as the Lagrangian and Eulerian viewpoints respec- 
tively. Most of the previous writers on the subject of finite strain have, 
probably for reasons of mathematical convenience, adopted the Lagrangian 
view-point but in the present paper (which is concerned with actual applica- 
tions of the theory) the Eulerian point of view is regarded as fundamentally 
more significant than the Lagrangian. In this connection the following 
quotation from the recent paper by Seth (7) is to the point: 


“Like the body-stress equations these (the strain components) 
should be referred to the actual position of a point P of the material 
in the strained condition, and not to the position of a point considered 
before strain. The importance of this point, overlooked by various 
authors, can not be exaggerated. Apparently Filon and Coker (9) 
were the first to notice it and to stress its importance.” 


In the classical theory one of the fundamental results (derived from the 


principle of energy conservation) expresses the connection between stress and 


strain as follows: 


* Received February 23, 1937. 


235 


236 F. D. MURNAGHAN. 


The stress tensor equals the gradient of the elastic-energy-density with 
respect to the strain tensor. ibe 


This fundamental principle (which is the formulation of Hooke’s law in its 
most general form) merely states that, in a virtual displacement of the strained 
elastic medium, the virtual work of all the forces, both surface and body, acting 
upon the medium may be obtained by integrating over the medium the scalar 
product of the stress tensor_by the variation of the strain tensor. We show 
in the present paper that this principle is merely an approximation which, 
whilst valid in the infinitesimal theory, is not valid in the finite theory. The 
exact principle is that the virtual work is obtained by integrating over the 
medium the scalar product of the stress tensor by the space-derivatwe of the 
virtual displacement vector and it is only in the infinitesimal theory that one 
may with propriety equate the variation of the strain tensor to the space 
derivative of the virtual displacement vector. It is a fortunate circumstance, 
the demonstration of which is the raison d’étre of the present paper, that the 
exact equations, valid for any deformation, are sufficiently simple, at least in 
the case of an isotropic solid, to be applied and to be compared with experi- 
mental results. We apply them to Bridgmaun’s experiments with solids and 
liquids under high pressures (up to 20,000 atmospheres) and find remarkable 
agreement without introducing more than the two elastic constants of the 
infinitesimal theory. We also treat the Young’s modulus experiment and 
obtain at least a qualitative explanation of the yield point phenomenon which 
is not cared for in the classical theory. The mathematical treatment proceeds 
most naturally and simply when one uses the methods of tensor analysis. 
It is, however, not necessary to be especially familiar with these methods in 
order to understand the reasoning and we shall indicate at appropriate places 
how a non-tensor argument (called, for brevity, Cartesian) would proceed. 
In the interest of clarity (and probably also of brevity) it has seemed better 
to make the paper self-contained. 


1. The strain tensor and its variation. We are concerned with a three- 
dimensional medium (which we shall regard as a collection of particles) and 
with two positions of this medium to which we shall refer as the initial or 
unstrained and the final or strained position. A typical particle of the medium 
will have initial and final codrdinates. In the classical theory it has been 
usual to employ the same reference frame (rectangular Cartesian) for both 
the initial (unstrained) and final (strained) positions and to denote the initial 
codrdinates by (a,b,c) and the final codrdinates (of the same particle) by 
(x,y,z). For the treatment we propose here it is inconvenient to tie our hands 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 237 


at the very beginning by the restrictive hypothesis that the same coérdinate 
reference frame will be used to describe both positions of the medium; the 
advantage of not making this hypothesis being that it is then possible to make, 
or contemplate making, transformations of the final codrdinates without thereby 
enforcing a change of the initial codrdinates. In the language of tensor 
analysis the initial coordinates will be invariants or scalars under transforma- 
tions of the final codrdinates. In order to emphasize this fact in the sym- 
bolism we shall methodically write the labels necessary to distinguish from 
one another the various members of a set of scalar quantities to the left of the 
letter which is the symbol for the set; reserving, as is usual, the right of a 
letter for the labels which are necessary to distinguish from one another the 
various components of a tensor of which the letter is the symbol. Thus we 
shall denote the initial codrdinates of a typical particle of the medium by "a 
and the final cordinates of the same particle by z* (the labels r and s running 
independently over the range 1,2,3). The initial Codrdinates Ta, as well as 
the final codrdinates x*, are, independently of each other, any sets of coordinates 
which we may find convenient; e. g. the initial codrdinates may be rectangular 
Cartesian and the final codrdinates space polar. We make the usual assumption 
that either codrdinate system is differentiable (with continuous first derivatives) 
with respect to the other and we adopt the notations 


= ; 3,0" = 0x" /08a 


for the first order partial derivatives. As the notation implies "a,, is, for fixed r 
and varying s, a covariant vector, namely the gradient of the scalar function "a 
whilst sa” is, for fixed s and varying r, a contravariant vector (which furnishes 
the direction of the codrdinate line along which ‘a varies, the other two a’s 
being held constant). The fundamental reciprocal nature of these two vectors 
is described by the formulae 


('a,c) (6,27) (74,5) = 35". 


In these formulae we follow the standard convention of tensor analysis ac- 
cording to which a repeated label (in this paper always taken from the Greek 
alphabet) occurring once above and once below indicates summation with 
respect to that label over the range 1, 2, 3; and "3, 5.” each have the value unity 
when r =s and the value zero otherwise. As the notation implies ",8 is a set 
of 9 scalar functions whilst 8.” are the 9 components of a single mixed tensor. 

The initial and final squared elements of arc length will be given by 
formulae of the type: 


(ds,)? = age(d %a) (d 8a) ; (ds)? = gag dx* 


| 
5 
] 
] 
) 
) 
x | 
/ 
| 
; 


238 F. D. MURNAGHAN. 


which are induced by the postulated underlying Euclidean metric of the space 
in which our medium is being deformed. For example if both codrdinate 


systems are rectangular Cartesian 
(dso)? — (da)? + (db)? + (de)?; (ds)? = (dx)* + (dy)? + (dz)? 


whilst if the initial codrdinate system is rectangular Cartesian and the final 


space polar 
(dso)? = (da)* + (db)*+ (de)?;  (ds)* = (dr)? + 12(d0)? + r? sin? 0(dp)*. 


On replacing d *a by its equivalent: d(*a) = *a,o dx* we may express (ds,)? 
in terms of the differentials of the final coordinates 2": 


= agc(d %a) (d ®a) = ho, da? dx™ 
where 
hg = ape (Fa,q). 


In a similar manner we may express (ds)* in terms of the differentials of the 


initial codrdinates ‘a: 


(ds)? = gag dx* dx® = agk(d a) (d Fa) 
where 
pak = Jap 


We obtain thus two equivalent expressions for the difference (ds)? — (ds,)’ 
of the initial and final squared elements of arc length: 


(ds)? — (ds))* = 2eag dx* dx® = 2 agn(d a) (d Fa) 
where 


= Iva — 2 van = pak — 


For a displacement in which lengths are preserved the difference of the squared 
elements of arc length is zero (identically in the differentials d*a, or, equiva- 
lently, the differentials dz") and so the quantities €pq and yqy are zero for such 
a rigid displacement. In general we regard the quantities epg or pgy as descrip- 
tive of the strain or deformation and we thus have two methods of describing 
the strain: 

(a) the description, by means of the quantities €g, in which the final 
coordinates 2” are adopted as the independent variables in terms of which the 
description is made and 

(b) the description, by means of the quantities »gy, in which the initial 
coérdinates "a are adopted as the independent variables in terms of which the 
description is made. 


| 
| 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 239 


In the terminology of the introduction these are, respectively, the Eulerian and 
Lagrangian descriptions of the strain. We shall also refer to them as the 
tensor and scalar descriptions, respectively; and shall call the quantities e¢ 
the tensor strain-components and the quantities gy the scalar strain-components. 
The quantities €), are the covariant components of the strain tensor ; this tensor 
may also be presented in mixed form, or in contravariant form, by means of 
the formulae 


where g’* is the reciprocal of grs : g’ gas = 4s". Similarly it is convenient 
to introduce other (but equivalent) scalar descriptions of the strain by means 
of the formulae: 


where is the reciprocal of : (?%¢) (age) 8. In the technical language 
of tensor analysis we may say that we use the tensor gpq, and its reciprocal g?, 
for stepping labels down and up upon tensor quantities; and the matrix pc, 
and its reciprocal 4c, for stepping labels down and up upon scalar sets. 

When rectangular Cartesian codrdinates, relative to the same reference 
frame, are used for both the initial and final positions the tensor strain com- 
ponents take the form 


da da 0b ab dc 
dy Oz dy Oz dy Oz 


whilst the scalar strain components are given by 


8c ' 8b 0b be 


Rouen 
On denoting the displacement vector (4—a, y—b, z—c) by (u, v, w) we have 
du f (% dw\ ? 
ry 


Ou. Ov Ov. dw dw \ 
dy 2 dy dy dy 


fv Ow du Ou. Ov Ww. Ow Ow 


e 
je 
| 
| 
i 
| 
h 
yz = 
] 


240 F. D. MURNAGHAN. 


In the classical infinitesimal theory the partial derivatives 0u/dz,---,du/da, - - 
are regarded as infinitesimal and so we may put 


Ou Ov Ow 


du du du dy , Ou dz _ du dv dw 


Since 


we have, to the degree of approximation contemplated by the infinitesimal theory, 
du/da = 0u/dx ete., and there is no distinction between the tensor strain com- 
ponents and the scalar strain components. 


Relation between the elements of volume. A region occupied by the 
medium in its initial position is described by setting the codrdinates "a func- 
tions of some convenient three independent variables and then the initial 
element of volume dV, is given by the formula dV, = Vc | d(a)| where c 
denotes the determinant of the matrix gc and | d(a)| denotes the numerical 
value of the product of the differentials of the independent variables by the 
Jacobian determinant of the three coordinates "a relative to the independent 
variables. Similarly dV, the element of volume occupied by the same particles 
when in the strained position, is given by the formula dV = Vg | d(z) | where 
g denotes the determinant of the tensor g,s. Now the relation 


ape (d%a) = (ds.)? = hag dax* 
implies 


Ve | d(a)| — | d(a)| 


where h denotes the determinant of the tensor h,s. On writing this tensor in 
its mixed form: hys = gra hs* and using the theorem that the determinant of 
the product of two matrices equals the product of the determinants of the two 
factors we find h = g det(hs”) and so 


dV./dV = Vc | d(a)|/V | a(2)| 
= Vh | d(z)|/Vqg | d(2)| = Vh/g = Vdet(hs"). 


On writing the relation hpg = Gpq — 2€pq in its mixed form hp? = 6? — 
we find det(h.”) = 1— 27, + 41, — 81, where 


Red? 


4 

j 
| 

i 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 241 


are the three strain invariants. They are, respectively, the sum of the diagonal 
elements, the sum of the principal two rowed minors, and the determinant of 
the matrix «7 which presents the strain tensor in mixed form. When the 
strain is homogeneous i.e. does not vary from point to point of the medium 
we may replace the ratio dV,)/dV by the ratio Vo/V where V, is the initial 
volume of any portion of the medium and V is the volume occupied in the 
strained position, by the particles which initially occupied the volume V,. 
We have, then, for a homogeneous strain the relation 


V.o/V = V det — 2657) = V1— 21, + 41, — 813. 


For the special case of a homogeneous strain which is also at each point 
isotropic (which will be the case in an isotropic medium subjected to uniform 
hydrostatic pressure) the strain tensor will be a scalar tensor: ¢.” = «d," and 
we have the relatively simple relation 


Vo/V = 


For the classical infinitesimal theory this reduces, as is at once seen on writing 
(1— =~ 1—3e+---, to Vo/V =1— de or 


AV 


V—V 
€ 


3 V 


AV 


(to the degree of approximation contemplated by the theory). The exact 
relation, valid for a finite strain, to which the relation just written is an 
approximation is 

— (Vo/V)*"}. 


The mathematical description of a homogeneous strain is that the strain tensor 
should be constant in the tensor sense i.e. its absolute or covariant derivative vanishes. 
This implies that the absolute derivatives of the invariants J,, I,, I, (or what is the 
same thing, since these are scalars, their space derivatives) vanish so that dV,/dV is a 
numerical constant. The hypothesis 450 (where as is usual labels following a 
comma indicate covariant differentiation ) implies = 0 and since Ning (ap?) aa Ba 
an easy calculation yields og -+ (4, , Ba == (. In particular when the a’s and 
are rectangular Cartesian codrdinates so that ka = 6? ka/dup and 
the ka must be linear functions of the wr. Conversely if this is the case the strain 
components are numerical constants and the strain is homogeneous. 


The variation of the strain tensor. In order to introduce the concept of 
virtual work and thereby express the conditions for equilibrium of the strained 
medium we must adopt the dynamic as opposed to the static viewpoint. In 
other words instead of regarding the strained position of the medium as some- 
thing fixed and final we must regard it as capable of variation. To do this 


2 


f 
| 
| 


242 F. D. MURNAGHAN. 


conveniently, from the mathematical viewpoint, we conceive of the final 
coérdinates z* as depending not only upon the initial codrdinates a but also 
upon an accessory parameter 6 (which could, in hydrodynamics, conveniently 
be taken as the time variable). We shall denote differentials with respect to 
the parameter # by the symbol D: 


Dz? = 79 79 


it being understood that in the partial differentiation with respect to 6 the 
coordinates 7a are kept constant (i.e. the D denotes the substantial or particle 
differentiation of hydrodynamics). If we have any tensor function f--: of the 
coordinates x” we shall denote by 8f-:- the tensor of the same type defined 
by the rule 

When the codrdinates 2” are Cartesian covariant differentiation is merely 
ordinary space differentiation of the tensor components and so df:-- = Df-:: , 
Even when the codrdinates are not Cartesian this relation holds for any scalar 
function: 5f = Df since f,, = 0f/dx". We effect a small economy of notation 
by defining 8x” by the relation 82” = Dz" and we refer to the contravariant 
vector dz" as the virtual displacement vector. Before proceeding to the neces- 
sary calculation of the variation of the strain tensor and of its scalar com- 
ponents it must be clearly understood that the variation of any scalar function 
of the ‘a is zero; e.g. 8pgc = 0. This result is a trivial consequence of the 
definition of the 8 symbol for in the differentiation with respect to 6 the 
coordinates ‘a are held constant. 

Owing to the independence of the variables "a and 6 differentiations with 
respect to them are interchangeable as to order (it being supposed that the 
second order derivatives involved exist and are continuous). Hence 


0 0 


and on multiplication by d*a (which is independent of 6) and summation with 


respect to s we find 
0 
This implies the tensor equation 
8(dx*) = (82*) da 


since it is the form to which the tensor equation reduces when the codrdinates 


| 

| 

| 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 243 


az are Cartesian. Since the variation of the metrical tensor is zero: Sgrs = 0, 
the tensor equation just written may be put in the equivalent form 


= da. 
Since d("a) dx* and 8d("a) = 0 we have 
8(7,q)dx* = — "aq 8(dax*) = — 
and since this must hold for arbitrary dx” we must have 
= — x. 


From this expression and the relation hpg = agc(d,») (fa,¢) we can read off 
at once the formula for the variation of hpg (or, what is the same thing, of 
R€pq) We find 


Shpg = apc ° Pag + 
= — apC {20,7 (827) » + 
= — hyq(827) — hp; (827) 
= — hq" (8x7) — hp" (827) 


This will be seen to be the significant formula for our later purpose. On 
writing hpg = Jpq— 2€pq it may be written in the equivalent form 


= + (Sxp),¢} — + & (827) 
For the classical infinitesimal theory it is allowable to write 


Seng = 3{ (820) + (82p),0} 


and it is the difference between this and the exact expression just furnished 
that makes it incorrect, as stated in the introduction, to write the stress tensor 
as the gradient with respect to the strain tensor of the elastic energy density. 


Criterion for a rigid virtual displacement. A virtual displacement is 
said to be rigid when 8(ds?) =0. Since 8(ds,.)? and 


(ds)* — (ds,)* = 2eag da* dx8 
an equivalent description of a rigid virtual displacement is 
8 (€ag da* = 0. 


One must be on one’s guard against the error of supposing that (because in a 
rigid displacement €)7—= 0) in a virtual rigid displacement the strain tensor 


| 
y 
0 
r 
t 
e 
e 
h 
e 


244 F. D. MURNAGHAN. 


€pq is constant: Sepg == 0. The evident fallacy in such a guess being the neglect 
of terms involving §(dz") which quantities are not in general zero in a rigid 
displacement. On the other hand the scalar components pq of the strain 
tensor are constant in a rigid virtual displacement; for the criterion for a rigid 
virtual displacement may be put in the form 8(agy d%ad%a) —0 and this is 
equivalent to 8 agy(d %a) (d 8a) =0 since d'a is independent of 6. Since this 
equation must hold for arbitrary d%a we must have 8,qy7 = 0 and this necessary 
condition is clearly sufficient. For any virtual displacement, rigid or not, 
we have 
5(ds)* = 8(gap da* = §(dx* 
— + da? 


so that the criterion for a rigid virtual displacement may be put in the form 
+ (8%q),» = 90 (equations of Killing). Amongst the possible rigid 
virtual displacements are those for which (82) q—0; these are the virtual 
translations which appear, when the codrdinates v7, are Cartesian, in the form 
= constant. Since 8(ds)? = 28(agyd%ad a) we have, for any virtual 
displacement whatsoever, 


28 apn: da = { + (8x8) a}dx* 
implying 
nan = 3{ + (4,0). 


2. The stress-tensor and the virtual work of the applied forces acting 
on the medium. Let us consider any portion of our elastic medium which is 
bounded by a closed surface and denote by S, and 8 the initial and final 
positions, respectively, of this bounding surface. The codrdinates ‘a of the 
initial position of any particle of this bounding surface (and equally the 
coordinates «* of the final position of such a particle) are functions of two 
independent parameters and the surface element of S may be described by 
means of the covariant vector dS; —= Vg d(2®,x%) (where p, q, r is an even 
permutation of the natural order 1,2,3 and d(a,2%) denotes the product 
of the differentials of the two independent variables times the Jacobian of the 
two coordinates 2? and 4 relative to these independent variables). When the 
coordinates 2” are rectangular Cartesian 


dS, = d(y, z) ; = d(z, 2) ; dS, d(z,y). 
Denoting by dS the magnitude of this typical surface element—so that 


(dS)? = dS, dS, 


f 
| 
| 
| 
| 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 245 


—the stress tensor 7* is a linear vector function which associates with each 
surface element dS, a stress vector F” by means of the formula F’dS = T'd8,. 
- The virtual work of the stresses across the boundary 8 is accordingly 


(F8 dS) 8x = dag 


and this is equivalent to the volume integral 


(1 $05) dV 
V 


extended over the volume V bounded by S. If there are mass forces—M" per 


unit mass—acting on the medium the virtual work of these is f pMB dag dV, 


where p is the mass density, and so the virtual work of all the forces acting 
on any portion of the medium is 


V 


We now make the physical assumption (criterion of equilibrium) that this 
virtual work is zero for any rigid virtual displacement. Amongst these rigid 
virtual displacements are the translations which are characterized by the tensor 
equation (82)),¢ 0 and so we must have 


f (T=, + pM®) dV =0; 
V 


since 62, may be assigned arbitrarily at a given point and since the volume V 
of integration is arbitrary this forces 


+ pMt = 0. 


Consequently the virtual work of all the forces (mass and surface) acting upon 
any portion of the medium in any virtual displacement whatever is given by 
the expression 


Virtual work = f (82xg) dV. 


Furthermore since this must vanish for any rigid virtual displacement i. e. for 
any virtual displacement for which (8z,).¢ + (8%¢),»=0 the stress tensor 
must be symmetric: 7%" = 7'*, This relation enables us to write the expres- 
sion for the virtual work of all the forces acting upon the medium in any 
virtual displacement whatever in the form: 


= 

= 


246 F. D. MURNAGHAN. 


Virtual work = f T°B{ + (828),}4V. 
V 


In the classical infinitesimal theory this may be written as T*8 Seg dV but 


such an approximation is not legitimate in the finite theory. 


3. The elastic potential and its connection with the stress tensor. We 
now consider an element of volume dV of the medium in its strained position 
and denote by p the density of the matter occupying the element of volume dV 
so that the element of mass is dm=pdV. The principle of conservation of 
mass is expressed by the formula: 


8(dm) =8(pdV) =0. 


In order to apply the fundamental energy-conservation law of thermodynamics 
we denote by 7’ the temperature of the element of mass dm; by o the entropy 
density (per unit mass) so that the entropy of the mass dm is o dm = po dV; 
and by udm the internal energy of the mass dm. Then the principle of thermo- 
dynamics to which we have referred says that T8(o dm) = 8(udm)—Virtual 
work of all forces acting on dm. On introducing the free-energy density 
¢ = u— To and availing ourselves of the principle of conservation of mass: 
§ dm = 0 we find, on integrating over any portion V of the strained medium, 


ap dm — — o dm: 8T. 
4 


On writing dm = p dV and observing that this relation must hold for arbitrary 
volumes V we obtain the fundamental formula connecting the elastic potential 
with the stress tensor 


p 8h = T* (S22) — po 


In order to derive from this a connection between the stress tensor 7"* 
and the strain tensor ¢-, we must make some hypothesis concerning the func- 
tion ¢. We shall assume that it depends only on the three gradient vectors 
ra, (or, equivalently, on the reciprocal set ,,2*) it being understood that either 
set may appear both covariantly and contravariantly. In other words we shall 
assume that ¢ is a function of the vectors "a, the metrical tensor grs, the 
scalar quantities pgc and the temperature 7. We shall confine our attention 
in what follows to isothermal variations so that 7 is a constant parameter in ¢ 
to which attention need not be explicitly directed. Then 8 must be zero in 


Op 
8( ag) 


any (isothermal) rigid virtual displacement; in other words 


| 
| 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 247 


provided + (8%q),» = 90. On inserting the value — (‘a,q) (87%) given 


(“a7) must be symmetric 
in p and q ("a4 = g%("aq)). Hence the function ¢ of the nine variables "a, 
is conditioned by the three linear, homogeneous, first order, partial differential 


in.1 for 8("ax) we see that the tensor 


equations : 


These equations are readily seen to form a complete system (the commutator 
of any two being the third) and so the general solution of them is a function 
of 6 = independent solutions. Since in any rigid virtual 
displacement we know that the six functions pqy satisfy the three partial 
differential equations just written. Hence ¢ can involve the vectors "a, only 
through the scalar functions pgy: = poy; The quantities cannot 
occur in the expression for ¢ since ¢ is a scalar and the only scalar functions of 
the metrical tensor are numerical (e.g. = 8,4 = 3). Now under a 
transformation of coordinates the quantities pq¢ (which are scalars under 
transformation of codrdinates z*) transform as covariant tensors. We say that 
the medium is isotropic when the elastic-energy density ¢ is unaffected by a 
transformation of the codrdinates ‘a; for instance when the coordinates "a are 
rectangular Cartesian an arbitrary rotation of the reference frame must leave 
the function ¢ unaffected. In order that this may be the case ¢ must involve 
pq) pg Oly through the “invariants ” (under transformation of the codrdi- 
nates : 


1%2 1 


These quantities J,,J2, Js are, respectively, the sum of the diagonal elements, 
the sum of the principal two-rowed minors, and the determinant of the matrix 
’m. Hence, for an isotropic medium: ¢ = ¢$(J,,J2,/3, 7’). We prove in the 
appendix that the quantities J,,J2,J, are functions of the three invariants 
I,, 12, I, of the stress tensor ¢,7 and so we may write, for an isotropic medium, 


¢= Is, T) 


i.e. @ is a function of the components of the strain tensor ¢.” and 7’. Con- 
versely if we make the hypothesis that ¢ is a function of 7 and of the strain 
components (tensor) ¢.” alone this implies that ¢ is isotropic; for the only 
scalar functions of ¢«,” are functions of its invariants J,,/,,7;. In order to 
prevent possible misunderstanding of this remark [in view of the fact that in 


0 
| 


i 
i| 
$ 
; 
i 


248 F. D. MURNAGHAN. 


the classical treatment of elasticity ¢ is taken for crystalline (— non-isotropic) 
media as a quadratic function of the strain components] we may say that it is 
tacitly understood in this classical procedure that a special privileged reference 
frame, determined by the axes of the crystal, has been chosen. The coefficients 
of the quadratic form are accordingly not scalar quantities but constitute a 
tensor which depends on the orientation of the crystalline axes. 


The fundamental stress-strain relations for an isotropic medium. As 
we have just seen the elastic energy density ¢ is, for an isotropic medium, 
a function of the tensor strain components ¢s’, involving these through the 
strain invariants J,,/.,/;. It will be, for the moment, more convenient to 
regard ¢ as a function of the covariant strain components €rs (€s" = g"® €as !) 
and we shall, if necessary, symmetrize its formal expression; i.e. we shall 
replace each ¢-s, wherever it occurs in the expression for ¢, by its equivalent: 
€rs = 4 (ers + esr). Denoting by the partial derivative of with 
respect to ¢;s all the other ¢,-, (including ¢.,) being held constant in the dif- 
ferentiation (so that in this formal differentiation no attention is paid to the 
symmetry relations = ,s) it follows that 0p/ders = We have seen 
in 1 that 

S€pq = — = 3 (827) + hp (827) 


where hog = — 2€pq, and hence 


0 0 0 
86 — — (Br) a + (827) — hp? (827) 
€aB €ap €ap 


since = 06/des,, Since in an isothermal virtual displacement 
p 86 = T8 (Sz,),¢ it follows that, for an isotropic medium, 


T — Jas 
P94 8 he (82a) 


Since the virtual displacement is arbitrary the components (82) ,q of the space 
derivative of the virtual displacement vector may be assigned arbitrary values 
at any point 2” and so we must have 


Op 
That these equations imply follows from the fact that involves. 
the strain components ¢,, only through the strain invariants J,,J.,J;. For 
example J, = = g% egg so that 


 _ 
i 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 249 


The formulae just obtained are fundamental but it is frequently more con- 
venient to present the stress tensor in its mixed form 7';”._ Since 


and so our formulae appear as 


The result dV,)/dV = V1— 21, + 41, — 81, obtained in 1, may be written 
in the equivalent form p= po V 1— 2/, + 4/2. 81, (since the principle of 
conservation of mass implies pdV py In the classical (infinitesimal) 
theory the strain invariants J,, 2,7; are infinitesimal quantities of the first, 
second and third orders of magnitude respectively. We may therefore write, 
to a first approximation, p = po and to a second approximation p = po(1 —J,). 
Keeping only the first approximation our fundamental stress strain relations 


reduce to 


Op , 


These are the basic formulae (expressing Hooke’s law) of the classicical theory, 
¢’ being the elastic energy per unit initial volume (or, what is the same thing 
to the degree of approximation contemplated by the classical theory, per unit 
final volume). 

Even in the case of finite strains the strain invariants are relatively small. 
The two cases to which we shall devote some attention in detail in the present 
paper are: 

(a) uniform hydrostatic pressure; here the stress and strain tensors are 
scalar tensors: (where p is what is commonly 
called pressure) and J, = 2 = 3f?, I; = —f*. We shall discuss a little 
later the agreement of the pen theory with the results of experiments by 
Bridgman (10) on the compressibility of sodium under pressures ranging 
from 2,000 atmospheres to 20,000 atmospheres. In these experiments the 
ratio (V) — V)/V, varied from .030 at a pressure of 2,000 atm., to .189 at a 
pressure of 20,000 atm. Since Vo/V = V det (8.” — 2e.") = (1 + 2f)*” the 
corresponding values of f are .010 and .075 respectively; hence J, varied 
between — .030 and — .225, J, varied between .001 and .0169 whilst J; varied 
between — .000001 and — .00042. 


derg 


F. D. MURNAGHAN. 


(b) uniform linear stress (Young’s modulus experiment). Here the 
stress tensor is scalar with two components zero whilst the strain tensor is 
scalar with two components equal : 


T,* Tj! = 0; Ex” == oe,” 


where o, Poisson’s ratio, has a value <.5. From the formula giving the 
volume change we have V,./V = (1 + 2ce.*) V1— 2e,* so that ¢.* cannot 
surpass the value .5. Hence J, = (1 — 2e)e,* is a small fraction even for very 
large strains and J, = (a? — 2c) (€-*)*, (e*)* are smaller. 

We may, therefore, even for large strains hope to secure good agreement 
with experiment by expanding ¢(«,”) as a power series in the strain com- 
ponents and neglecting terms of orders of magnitude greater than an agreed 
upon order (say the second, third, etc.). In the classical, infinitesimal, theory 
the agreed upon order is the second. We shall agree upon the third but we 
call explicit attention at this point to the fact that our theory gives remarkable 
agreement with experimental results even if we do not introduce any more 
constants (i.e. coefficients in the expansion of the elastic energy density ¢) 
than those (two in number) introduced in the infinitesimal theory. In fact 
in the case of the compressibility experiments the two constants combine into 
a single one so that a one constant formula suffices to predict to a high degree 
of accuracy the connection between pressure and volume over the extensive 
range from 2,000 to 20,000 atmospheres. It is clear that an additive constant 
in ¢ is of no significance since ¢ enters our fundamental equations only through 
its partial derivatives. In order to keep as closely as possible to the notations 
of the classical (infinitesimal) theory we shall expand pod instead of ¢: 


A+ | 
pod = al, + —2Qyl, + + + 


As we shall see in a moment the hypothesis that the stress is zero in the initial 
state (characterised by «7 0) forces «0 and the assumption of the in- 
finitesimal theory is 


(where, of course, the invariants J,, J, of the classical theory are the approxima- 
tions cbtained by neglecting the second order terms in the expressions for the 
strain components). In the usual presentations of the infinitesimal theory the 
invariant 1’, = eg%e,* = J,” — 2], is used instead of our J, and the expression 
for appears as 


250 

| 

| 
i 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 251 


2 
On using the relations 
where £,” is the tensor reciprocal to €.”: 


ba" 


and remembering that p/pp = V1— 21, + 4/,— 81, = - we find 


(a+ Al, + (31 + m)I,? + mI, — 2nI3) 8s" 
{+ [2(u~— a) — (m + 2A)I, — 2(31 + 
+ 2(mI, — 2p) + 


The value of 7,” in the initial position (¢.” = 0) is, accordingly 
(T's")o 
(so that the assumption of isotropy forces the stress in the initial position to 
be scalar i. e. of the nature of a hydrostatic pressure) ; we shall make the usual 
assumption that the stress in the initial position is zero forcing a0. On 
multiplying out by the factor 1—J,+-.-~- and neglecting quantities of 
higher order than the second we find 
= {l, (31 m —xr)I,? + MI 
+ — (m + 2A t+ 2) Ls} es” — 4preq”es* + 

We shall not keep in the following the quantities arising from the third order 
terms in the expansion of pod i.e. we shall set 10, m=0, n=O when 


we obtain 
= XT, (1 — 11) 85" + (A + p) Li jes” — 


The first invariant 7’ — 7 ,% of the stress tensor is, accordingly, 


(3A + 2p) I, (5A + 6p) + Sule. 


If we neglect the second order terms we obtain the expressions of the in- 


finitesimal theory 
=d1,8." + Rper® T= (3A + 2p) Ts. 


Although, as we shall see, a good approximation to the results of experiments 
may be secured by using only two elastic constants A, » i.e. by neglecting the 


252 F. D. MURNAGHAN. 


third order terms in the expansion of pod a true second order approximation 
would keep these third order terms thus introducing five elastic constants 
dA, », 1, m, n. Neglecting, however, third order terms in the expressions for 


the stress tensor the second order approximation is ” 


Tf = + (31 + m—A) 12 + mlz} 85" 
+ {2 — (m + 2d + + nl 
T = (34+ + (91 + 2m — 5A— 6p) 1,2 + (3m + n+ 8p) 


(the invariant {,* of the reciprocal of the strain tensor e.” being I2/J;). 
In the treatment by Seth (7) of the problem of finite strain the strain com- 
ponents were not truncated, as in the classical infinitesimal theory, by the 
omission of the second order terms, but the equations 


= + ; T = (8A + 


of the infinitesimal theory were taken over with the following explanatory 
remark “ Since this is the simplest tensor form that we can take, it is quite 
natural for us to assume that the stress-strain relations are governed by equa- 
tions of the above type.” From the discussion given above it is clear that 
simplicity is not a sufficiently compelling reason; for the whole strength 
obtained by a willingness to keep the second order terms in the strain com- 
ponents is sacrificed by the omission of second order terms—such as those 
that occur in the terms —AJ/,*8," etc..—in the expressions for the stress 
components. 


4. The case of hydrostatic pressure; comparison of theory with ex- 
periment. In this simplest case the strain tensor is scalar ¢.7 5," and 
2 = 1—(V,/V)*/*. In most cases the stress is a pressure rather than a 
tension and « is negative and so we write 7." = — pd"; «——f. Then 
I, =— 3f; Iz = 3f?; I; —=—f*, and the formula connecting pressure with 
change of volume is 


fw — 1} 
a=3A+ 2p; = + 10n — 271 — 9m — n. 


Observe that if we avail ourselves only of the two elastic constants A, » of the 
classical (infinitesimal) theory 6 = 15 + 104 = da so that our formula is a 
one constant one 


p=a(f+d5f?); a= 2. 


i 

| | 

| 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 253 


For small compressions 


f= —1} = 3{(1— —1} 


and so the modulus of compression: p — AV/V) = p—~ 3f; hence the modulus 
of compression is 4/3? The following table gives a comparison of the theory 
with the experimental results of Bridgman (10) upon the compressibility of 
sodium. The quadratic formula p = af + bf? was used and the two constants 
at our disposal were determined by the experimental results at 2,000 atm. (the 
beginning of the experiment) and at 12,000 atm. (near the middle of the 
experimental range which ran from 2,000 to 20,000 atm., at intervals of 
2,000 atm.). In this way the availability of the formula for purposes of 
extrapolation (12,000 to 20,000 atm.), as well as interpolation (2,000 to 
12,000 atm.), was tested. 


TABLE 1. 

p AV/V,o f p (calculated ) 
2000 0295 .0101 — 
4000 0552 0193 4005 
6000 0779 0278 6022 
8000 0356 8005 

10000 1165 10003 
12000 1332 .0500 
14000 0567 14008 
16000 0631 16014 
18000 1767 0692 18006 
20000 OV51 20007 


a=1.874X 10°; b=1.052 10% 


The largest discrepancy is that corresponding to a measured pressure of 6,000 
atm., and a calculated pressure of 6,022 atm., an error of about one-third of 
1%. The other calculated values are correct to within one-tenth of 1%, most 
being much closer. Attention should be called to the fact that b = 5.6 a, 
so that a neglect of the third order terms in the expansion of pod (which 
neglect would force b to be 5a) would only disturb the agreement to the 
extent .6af?._ Thus the one constant formula p = a(f + 5f?) fits the data for 
sodium to an accuracy of within 1.5% over the range 2,000 atm., to 20,000 atm., 
the constant a having the value 1.92 « 10° (determined by the measurement 
at 12,000 atm.). The agreement is as good as could be expected since f is only 


254 F. D. MURNAGHAN. 


measured to an accuracy varying between .5% at 2,000 atm., to .07% at 
20,000 atm. 

A two constant formula does not give very good agreement with experi- 
mental results for liquids which are much more compressible than solids. 
However a three constant formula (which would result if the energy-density 
were expanded as far as fourth order terms) gives very good agreement. The 
following table gives the result of a comparison between the results of calcu- 
lation from a three-constant formula p = af + bf? + cf* and the experimental 
results of Bridgman (11) on the compressibility of N-Amyl Iodide, at 0° 
temperature, under pressures varying between 500 atm., and 12,000 atm. Over 
this range the pressures calculated agreed with those measured to within less 
than one per cent. 


TABLE 2. 
N-Amyl] Iodide (0°) 

p (obs) V/Vo f af bf? cf? p (calc.) 
500 .9685 .0108 444.9 49.4 10.2 —_—_— 
1000 .9442 .0195 803.9 161.7 37.2 1002.8 
1500 .9250 0267 1099 302.3 134.7 1496.4 
2000 .9094 0327 1347 453.7 174.9 1975.6 
3000 .8831 .0432 1780 792.9 404.2 2977.1 
4000 .8624 .0518 2136 1142 698.7 3976.7 
5000 8451 .0593 2445 1496 1048 4989 
6000 8304 .0659 2717 1848 1438 —- 
7000 8173 .0720 2967 2202 1871 7040 
8000 .8064 0771 3177 2525 229% 7999 
9000 7965 .0819 3375 2849 2753 8977 
10000 7873 .0864 3560 3171 3233 9964 
11000 .7786 .0908 3741 3502 3752 10995 
12000 .7706 .0948 3908 3822 4277 —- 


a = 512.04 K 10°; b = 424.8 X 10°; c= 501.2 X 10%. 


In order to show the dependence of the coefficients a, b, ¢ upon the temperature 
similar calculations were made from measurements at 50° C with the following 
result 


dso = 303.94 X 10?; = 316.26 KX 365.51 X 10°. 


A similar three constant formula for sodium over the range 2,000 to 20,000 
atmospheres (the constants being determined by the measurements at 2,000, 
10,000 and 20,000 atmospheres) gave the values 


| 
| 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 255 
a= 188.13 X c==37.2 
and the correspondence between the observed and calculated values was: 


p (obs) 4000, 6000, 8000, 12000, 14000, 16000, 18000 
p(calc) 4008, 6014, 8003, 11979, 13984, 15976, 17984 


an agreement to within one quarter of one per cent over the entire range. 


5. The Young’s modulus experiment. In this experiment the ends of 
a cylinder of length J are subjected to a uniform stress 7’, the sides being free 
from any applied force. The conditions of the problem are met by assuming 
U= px; v = py; w =z (where the z-axis is parallel to the generators of the 
cylinder) it being agreed that mass forces, such as the weight of the mass 
elements of the cylinder, may be neglected. The strain tensor is diagonal 
with diagonal elements 


= = p— hp’; — dr? 


and the stress tensor is consequently also diagonal. The numbers p, 7 must be 
such that 7,7 = 7,” =0; T.A=T. If e=—w/c'is the relative extension we 
have, since w,e=r(1+e) so that r—e—e?+- - - implying 


For simplicity of notation we write « for e,° and set ¢,1 —«,? = —ge,°. If 
= — u/a = —v/b denotes, for the moment, the relative contraction, in a 
direction perpendicular to the applied stress, we have 


To a first approximation ¢ = f/e measures the ratio of the relative contraction, 
perpendicular to the applied stress, to the relative extension in the direction 
of the applied stress; we shall refer to o as Poisson’s ratio. 

Since = T,” = 0 we have T = T,* = T,* and so 


= (3A + 2u)1, + (91 + 2m — BA— 6p) 1,2 + (3m +n + Bp) 


We shall content ourselves with examining what partial explanation of the 
phenomena observed in the Young’s modulus experiment may be obtained by 


& 
| 


256 F. D. MURNAGHAN. 


using only the two elastic constants A, » of the infinitesimal theory. On setting 
m=0, n= 0 we find 


On putting in the values J, = (1— 2c)e, Jz = (0? — 2c)? we find 


9 


T = (8A + (1 — 2a0)e — { (5A + 6p) (1 — 20)? + — 0”) Je’. 
The relation 7,” = 0 yields | 


A— 2a(A + pw) — {8(A + p)o? — 2(3A + p)o + = 0 
or 
1 40 —1)e} = 0. 
Since Vo/V = (1+ 2cc) V1 — 2e the maximum value of ¢ is .5 and so, granting 
>0,1+ (40—1)e> .5 80 thato =2/2(A+ yp). It is important to notice 
that this constancy of Poisson’s ratio o is not a mere approximation but an 
exact result. On inserting this value of o in the expression for 7’ we find 


By 

A+p 

where 1 = One is Young’s modulus. The first approximation, which 
p 


would be furnished by the infinitesimal theory, is 7 = He but the significance 
of the quadratic formula of the finite theory is that the graph of 7 against « 
is a parabola instead of a straight line. Hence 7 has a maximum value 


max = occurring when «= . What this means 


is that if a larger stress is applied the deformation cannot be of the simple 
type described by the formulae u=— pr; v=py; w=rz. For steel A is 
approximately 1.54 so that (7')max = H#/10 the corresponding value of 
being .2. In using the formula given above for 7 it should be noticed that 
2(1 +)? 
the finite theory, predicting a maximum stress (yield point) are qualitatively 
correct but the predicted value of (7') max is much -too large. 


e=r—tr=— where ¢ is the relative extension. These results of 


6. The stress-strain equations for a non-isotropic medium. The elastic 
potential is now a function $(pqy, pqc, J’) of the scalar strain components pq‘ 
the coefficients pgc of the quadratic form for (ds,)? and the temperature 7’. 
On using the expression for 8 pq given in 1, namely 


yy ~ 
T = (8A + (5A + + Spl. 
i 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 


28 nan = { (8%a),¢ + (8x8),a} (¢,2*) 


and agreeing that @ is symmetrized with respect to the scalar strain com- 
ponents so that 06/0(pqn) = 06/0( we find at once 


These expressions give, from the Lagrangian viewpoint, the stress tensor in 
terms of the gradient of the elastic potential relative to the scalar strain com- 
ponents »gy. In order to obtain the corresponding Eulerian equations we intro- 
duce the matrix j reciprocal to the matrix k whose elements 


= Jap(p,%*) (¢,0°) = 2 pan + mal; 
it being clear that the elements of j are given by the formula 


On taking the variation of the matrix equation jk =e (the unit matrix) we 
find 8j-k + j:8k—0 or equivalently, 6; —=—j-d5k-j. Hence 


a 
ap 
—— 2 ( 7) 8(orn) (783) 
so tha 
Since 


(?*7) = (Pa) = 
it follows that 


0 
= — 2p (ta) (Fa). 


Appendix. 


Expression of the quantities J,, J., J; in terms of the invariants I, I, I. 
The invariants s,, $2, 8, of any matrix u are determined by the equation 


det (A8," Us" ) — Sor Sa, 


257% 
(apn) 
3 


258 F. D. MURNAGHAN. 


It is immediately clear that if u,v are any two non-singular 3 X 3 matrices 
the two products uv and vu have the same invariants. In fact if E = (8,") 
is the three-rowed unit matrix 

det (AE — vu) = det u* (AE — uv) u = det (AE — uv). 
Secondly if uw is any non-singular three-rowed matrix with invariants 8), So, 8, 
the invariants o;,02,0; of its reciprocal wu are given by the formulae 


= S2/83 = 8/833; = 1/83. For 


det (AE — = — A* (det uw) det u) == {A®s3 — + As, — 1} 
3 


since det u = 83. 

Now if we denote by g the matrix whose elements are gpg and by ¢ the 
matrix whose elements are .,2” (the upper label denoting the row and the lower 
the column) the matrix whose elements are pgk = gag(»,*) (¢,7°) is t’gt where 
the prime attached to a matrix denotes, as usual, its transposed. Hence the 
matrix k whose elements are 


is c't’gt (c being the matrix whose elements are pgc). Similarly the matrix 
whose elements are hpq = (faq) is since the matrix whose 
elements are 7@,, is the reciprocal of the matrix whose elements are 57". Hence 
the matrix h whose elements are hg? = — is Hence its 
reciprocal h- is given by h —tce"t’g and since k —c"#’gt it follows that 
k and h™ have the same invariants. But the invariants of k are the coefficients 
of — »*,A, — 1, respectively, in the development of 


det (AE —k) = det[ (A— 1) E — 24] 
=8det(vE—); 
= (A— 1)*— 2(A—1)7J, + 4(A— 1) — 83. 
Hence the invariants of k are, respectively, 
2J, + 3, 4J, + 4J, + 3, 8J, + + 2J, +1. 
Similarly the invariants of h are, respectively, 


so that the invariants of h™ are, respectively, 


i 

. 


FINITE DEFORMATIONS OF AN ELASTIC SOLID. 259 


3—4/,+ 4/, 3— 21, 
1— 2/7, + 4/7, — 81,’ 1— 2/, + 41, — 81,’ 1— 27, + 4/7, — 81; 


Hence we have the equations 


and solving these we find 


2J,+ 3 


1 
1—2/, + 4I,— 


or, equivalently, 

4- 12d, Js +63; 


; i, 


Summary. 


In the present paper formulae are derived which enable one to calculate 
the stress in an elastic medium when the strain and the elastic energy density 
are known, no simplifying assumptions, such as smallness of strain, being 
necessary. For an isotropic elastic solid under hydrostatic pressure the fol- 
lowing one constant formula gives good agreement with experimental observa- 
tion (only two elastic constants A, » being used in the expression for the 


elastic energy density) 
p=a(f+5f); 


In the Young’s modulus experiment the formula for the extensional stress 
(again using only the two constants A, ») is 


_ (A+ 


21+ 3 
T = 1— E (Young’s modulus) 
A+ p A+ 
where « = lhe a al e being the relative extension. Hence 7 has a maximum 
A+ A+ 4 


Value — #, occurring when «= For a true second 


4(2A + 3p) 2(2A + 3p) 


3 
r 
Ss 
t 
S 


260 F. D. MURNAGHAN. 


order approximation (the infinitesimal theory being regarded as a first order 
approximation) five elastic constants occur and the corresponding formulae 
are either given or their derivation is immediate. 


THE JOHNS HOPKINS UNIVERSITY; 
INSTITUTE FOR ADVANCED STUDY. 


REFERENCES. 


1, G. Kirchhoff, “ Uber die Gleichungen des Gleichgewichtes eines elastischen Kérpers 
bei nicht unendlich kleinen Verschiebungen seiner Theile,” Sitz. math-nat. 
Klasse der kaiserlichen Akad. der Wiss., 9 (1852), p. 762. 
2. J. Boussinesq, “Théorie des ondes liquides périodiques,”’ Mémoires présentés 4 
VAc. des Sciences, 20 (1869), p. 516. 
3. E. et F. Cosserat, “Sur la théorie de l’élasticité,” Ann. de Toulouse, 10 (1896). 
4, P. Duhem, “ Recherches sur 1’Elasticité,” Ann. de V’Ecole nor. sup., 3me Série, 21, 
22, 23 (1904-06). 
5. E. Almansi, “Sulle deformazioni finite dei solidi elastici isotropi,” Rend. Lincei, 
Ser. 5, Tome 20' (1911). 
. Brillouin, “ Sur les tensions de radiation,’ Ann. de Physique, 10me Série 4 (1925). 
. R. Seth, “ Finite strain in elastic problems,” Phil. Trans. Roy. Soc., A 234 (1935), 
p. 231. 
8. A. Signorini, “Trasformazioni termoelastiche finite, etc.,” Soc. Italiana per il 
progresso delle scienze 14 (1936). 

9. E. G. Coker, and L. N. G. Filon, Treatise on photo-elasticity, Cambridge (1931). 
10. P. W. Bridgman, “ Electrical resistances and volume changes up to 20,000Kg./cm.’,” 
Proceedings of the National Academy of Sciences, 21 (1935), p. 109. 

11. P. W. Bridgman, “The pressure-volume-temperature relations of fifteen liquids,” 

Proc. Amer. Ac. Arts and Sc., 68 (1933). 


dy 

€ 

a 

a 

a 

i 
4 

( 
( 


MEAN MOTIONS AND DISTRIBUTION FUNCTIONS.* 


By Puitip Hartman, E. R. van Kampen and AuREL WINTNER. 


Let k==1,:--,n ands anit If z(t) is a function of the form 
k=1 


(1) a(t) =ax(t) + = 3 exp + ox), 
where Ax, % are real and a, > 0, put 
(2) z(t) | 2(t)| exp 2xig(t), 


where the sign of + | z(t)| is to be chosen for every ¢ in such a way that 
¢ = ¢$(t) becomes a continuous function of t. The function ¢(t) is said to 
have for {—» + co a mean motion pz if 


(3) $(t)/t—>p, i.e, = pt + 0(t); +4 


The problem of the existence and the determination of this constant » goes 
back to Lagrange’s approximative treatment of secular perturbations of the 
major planets and has been solved in the case n = 3 by Bohl.? The case n = 4 
has been treated by Weyl.* The present note attempts a general approach to 
the problem from the point of view of the theory of distribution functions. 
The connection between mean motions and asymptotic distribution func- 
tions is suggested by the following consideration: It is known * that if the real 
function = y(t) is almost periodic in the sense of Bohr, then y(t) possesses 
an asymptotic distribution function o(€) and one has, for every continuous 


function f = f(é), 


where 
(5) M{g(t)} = lim (1/7) g(t)dt. 
T 


It is understood that by the existence of an asymptotic distribution function 
of a real measurable function y(t), where 0 = ¢ < + oo, is meant the existence 
of a monotone function o(é) such that o(— ©) —0, o(-+ 0) —1 and, if é 
is a continuity point of o(é), 


* Received January 20, 1937. 5 Weyl [7]. 
* Bohl [3]. *Wintner [8]. 
261 


+00 
0 


262 PHILIP HARTMAN, E. R. VAN KAMPEN AND AUREL WINTNER. 


(1/T) meas [y(¢) (8), To+ oa, 


where [y(t) S é]r denotes the set of those points ¢ of the interval O=[¢ST7 
at which y(t) =€. Now suppose that the amplitudes a, of (1) satisfy the 
condition of Lagrange, i. e., that 


(6) Ak, > On, + ° ‘+ 


holds for a permutation (k,,---,kn) of (1,---+:,n). Then (3) may be 
replaced by the sharper statement that 


(3 bis) $(t) + o(t), 


where w(t) and also its derivative w’(t) are almost periodic in the sense 
of Bohr. Hence, on denoting by y(t) the almost periodic function 
¢' (t) =p+o'(t) and by o(€) its asymptotic distribution function, (4) is 


applicable, and becomes, when f(é) = é, 
+00 


(7) f édo(£) M{¢"(t)}, 


-00 
where, according to (5) and (3), 


(8) (t)} = lim =p. 
Thus 


(9) =», 


so that the mean motion of $(¢) appears as the first moment of the asymptotic 
distribution function o(€) of ¢’(t). 

If the amplitudes a, do not satisfy the inequality (6), the preceding con- 
siderations break down in view of the fact that ¢’(¢) is not, in general, almost 
periodic in the sense of Bohr, while a possible treatment of the problem within 
a class of almost periodic functions more general than those of Bohr leads to 
difficulties. In fact, one needs the particular case (7) of (4), and (7) is 
obvious only when y (~ ¢’) is a bounded function, a condition which is not 
satisfied in the majority of cases, if (6) does not hold. It is not difficult to 
prove that the function ¢’(t) has a distribution function o(€) and that the 
space-average represented by the Stieltjes integral on the left of (7) is ab- 
solutely convergent. The main difficulty arises in the identification of the 


5 Wintner [9]. 


an 


| 
1 
+00 A 
| 
vw 
0 
1 
is 
if 
if 
{ th 
C0) 
tie 
= 
i} 
| 


MEAN MOTIONS AND DISTRIBUTION FUNCTIONS. 263 


space-average with the corresponding time-average M{¢’(¢)}. It is known °® 
that the truth of Lindelof’s hypothesis in the theory of the Riemann zeta- 
function depends on a question of the same type, namely on the question as 
to the admissibility of the identification of certain space-averages with the 
corresponding (hypothetical) time-averages, as expressed by (4). 

It will be assumed that the frequencies Ax of (1) are linearly independent. 
This assumption, as will be seen from the proof, does not essentially affect the 
validity of the method. After proving the existence of the asymptotic dis- 
tribution function o(é) and of its first moment, the identification of the time- 
average with the space-average remains to be treated. The admissibility of 
this identification will be proved with the help of Birkhoff’s ergodic theorem,” 
by excluding, for fixed values of the amplitudes a, and the frequencies dx, a set 
of measure zero in the n-dimensional space of the phases %. Actually, there 
are some indications that Birkhoff’s zero set is empty in the present case. The 
assumption of the linear independence of the frequencies Ax is to the effect ® 
that the problem is of the metrically transitive type, so that the mean motion p 
will depend on the amplitudes a, and the frequencies Ax but not on the phases a. 

The explicit evaluation of » will be reduced for every n to the evaluation 
of a definite integral in the %-space. On comparing the results of the present 
paper with those of Bohl? (n = 3), it follows that if a, a2, a; are positive, 
then the integral 
2 

[a,° + a,4, cos (9), — + 4,4, cos — J, ) 
| fz + a,* + a,” + 2a,a, cos 2x — J,) + 24,4, cos (9, — + 24,0, cos — 


[a,* + a,a, cos 2m), + cos 2m}, dp, 
a,? +a,?+a,7 + 2a COS + 24,4, cos 2r}, + 24,0, cos 2m — 
0 0 


is equal to 

1 9 9 9 
= are + a3” — / (2243) 
if a; S a; + a, for all permutations (1, j,&) of (1, 2,3), while it is equal to 1 
ifd, > ad. + ay, and finally it is equal to 0 if either a, > a, + dg or dg > a; + de. 
Similar relations follow by comparison of the results of the present paper with 


those of Weyl (n = 4). These definite integrals show with varying a, “ dis- 


continuities ” of the same type as the well-known “ discontinuous ” integrals 


°Cf., on the one hand, Hardy and Littlewood [4] and, on the other hand, Jessen 
and Wintner [5], Theorem 31. 

7 Birkhoff [1]; ef. also Khintchine [6]. 

*Cf. Birkhoff [2], p. 371. 


0 


264 PHILIP HARTMAN, E. R. VAN KAMPEN AND AUREL WINTNER. 


of Sonine (n = 3) and Nicholson (n = 4), occurring in connection with Lord 
Rayleigh’s random walk problem and presented on pp. 411, 414 and 420 of 
Watson’s Treatise on Bessel functions. 

For a given function (1), put 


_ 2(t)y'(t) — 


where the prime denotes differentiation with respect to ¢. It is clear that, no 
matter which locally continuous determination is chosen for 


arg z(t) = —ilog [z(t)/ | 2(t)| ], 


» if z(t) ~0, 


one has 


[arg z(t) ]’ = 2my(t), if z(t) 


It is known? that a unique function ¢(¢) is defined by the following 
requirements : 


(i) (¢) is continuous for—_a <t<+o; 
(ii) $’(t) = y(t), if z(t) 0; 

(iil) =arg z(t) (mod), if ~0; 
(iv) 05 9(0) <3. 


The function ¢(¢) thus defined satisfies (2), where one has to choose, for 
every t, the sign + or the sign — according as the difference 


| 


is an even or an odd multiple of z, while there is no ambiguity in the case 
z(t) 0. The function ¢(t) will be referred to as the angular function 
belonging to (1). The values of ¢ for which z(t) —0, i.e., for which 
¢’(t) y(t) is undefined, clearly do not have a cluster point. 

Let x = - -,%,) denote the function 


arg z(t) | 


(11) Arde COS COS + (3 Ande sin ay sin 
a, cos + (3 ay sin 2rd,)? 


defined on the n-dimensional torus 
(12) 05% <1; (k—=1,- -,n), 


except on the set N of those points of © at which the denominator of (11) 
vanishes, so that N is the set on @ defined by the pair of equations 


(13) N: a, cos 2rd, = 0, G =3 sin 2xrth, 0. 


Suppose that the frequencies Ay of (1) are linearly independent and let 


a3 
be 
| 
| 


us 


MEAN MOTIONS AND DISTRIBUTION FUNCTIONS. 265 
Z denote the curve on the torus (12) which is defined by the parameter 


representation 
(14) Z: Dy =Ant + (mod 1) ; l1,---,n), 


where the parameter ¢ runs from 0 to + oo and the phases o% are arbitrarily 
fixed. 

First, it will be shown that the derivative ¢’(¢) of the angular function 
(t) of the function (1) has an asymptotic distribution function o(é) and 
that 
(15) o(€) = meas Ly, 


where the meas I denotes the n-dimensional (0,,: - -,%,)-measure of the set 
Tg of those points (¥;,- - -,%,) of the torus (12) at which the function (11) 
satisfies the inequality 
(16) x Sé. 


In order to prove this, notice first that the set (13) at which the denomi- 
nator of (11) vanishes clearly has a vanishing Jordan content (a detailed 
description of N will be given below). It is also clear from (11) that T¢ has 
for every € a Jordan content. On the other hand, the linear independence 
of the A; implies that to distinct values of ¢ there belong distinct points of the 
curve (14) on the torus (12). Furthermore, it is seen from (1), (10) and 
(11) that 
(17) x(Ait + ° Ant + On) = y(t) = ¢'(t) 


holds at those points of the curve (14) which do not lie on the subset (13) 
of the torus (12), i.e., at which z(t) 0. Now it is clear from (17) and 
from the definition of [¢ that y(t) S é holds if and only if that point of the 
curve (14) to which ¢ belongs is a point of Ty. It follows, therefore, from the 
Kronecker-Weyl approximation theorem that if {€; 7} denotes the sum of the 
lengths of those subintervals of the interval 0=¢=T on which y(t) S&, then 


{é; T}/T — meas 


This proves that y(t) = ¢’(t) has the function (15) as asymptotic distribu- 
tion function. 

Next, it will be shown that the set N defined by (13) is a closed, possibly 
disconnected, (mn — 2)-dimensional analytic manifold in the n-dimensional 
torus © defined by (12), and that the manifold N has no singularities or a 
finite number of singular curves according as there does not or does exist at 
least one permutation of such that 


f 
0 
1 


266 PHILIP HARTMAN, E. R. VAN KAMPEN AND AUREL WINTNER. 


holds for some m. It will be seen from the proof that if n = 2 or n = 3, then 
N consists of at most two analytic simple closed curves without singularities, 
Incidentally, N is, for an arbitrary n, empty if and only if the condition (6) 
of Lagrange is satisfied. This is clear from the definition (13) of N. 

In order to prove that N has the structure described above, let j, 1] be a pair 


of distinct values of k =1,- - -,n and let Jj, denote the Jacobian with respect 

to #;, 0, of the two functions F, G occurring in the definition (13) of N, so that 
0 

(19) = 4r* aja; sin 2a (0; — 


ti) 
Accordingly, a point P = (#,,- - -,%n) of the subset N of the torus @ is a 
singular point of the manifold N if and only if 

(20) where j,J=1,---,n; - 


Suppose that there exists on N a point P = (%,,- - -,%,) satisfying (20). 
Then one can arrange the numbers #,,- - -,%, into two groups 


such that 


(21a) vi, —Vi, =4; 
while 


Now it is clear that the pair of conditions (21a), (21b) defines on the torus ® 
a simple closed curve, and that this simple closed curve lies, in view of (20) 
and (13), on the (mn — 2)-dimensional manifold N if and only if the ampli- 
tudes a; satisfy (18). 

In what follows, it will be assumed for the sake of brevity that the ampli- 
tudes a, of (1) do not satisfy an equation of the form (18), i.e., that the 
manifold N is free of singularities, so that no point of ® which satisfies (20) 
is a point of N. 

In order to prove that the space-average occurring on the left-hand side 
of (7) is finite for the distribution function (15), one has, according to the 
definition of Tz, merely to prove that the function (11) is absolutely integrable 
over the torus ®. Actually, 


(22) f 
© 
The proof of (22) proceeds as follows: Let 


(23) Po: = (k =1,---,n), 


1 1 


i 
H 
b 
0 
ts 
be ( 
i 
0 

i 
( 

i 

| 


MEAN MOTIONS AND DISTRIBUTION FUNCTIONS. 267 


be a point of the manifold N, so that, since N is free of singularities, (20) is 
not satisfied by -,0n) = (01°,- +, 9n°), and so there exists at least 


one pair j, /, say 7 =1 and | = 2, such that 

(24) 2 | — | (mod 1), i.e., cos — 02°) A +1. 
Now the quadratic form 

(25) Q(u,v; 4n*[a,?u? + 2a,a, cos 2x (9, — Je) uv + ay?v?] 


is such that for a sufficiently small « > 0 and for some positive 7 = 7(e; P°) 


one has 
(26) Q(u,v; 0,0.) = (wu? + v?)y, if | | | <e. 


This follows for reasons of continuity from the fact that Q(u,v; 01°, 02°) is 
positive definite in view of (24) and (25). On the other hand, if a point 


(27) P* — (0,*,- 
is on N, then it is seen from (13) and from (25) that 


(28) [F(d,* + u, + v, On®) + + u, + 
= 9,*, + + as w? + 0? 0, 


and that the o-term holds uniformly for all choices of the point (27) on N. 
Hence it is seen from (26) that for a sufficiently small 6 > 0 and for some 


positive = P°) one has 


+ [G(d,* + U, v,* + V; }? (u? 


whenever (27) is a point of N such that 

(30) | — | < 8, | < 8, while u? + v? 

Now it is clear from (11) and (13) that 


for all points (#,,- - -,%,) of ® which do not lie on N. It is seen from (29) 
and (31) that 
(32) | + u, + 0, +, 9,*)| = O([u? + 07]4), w+? 0, 


holds uniformly with respect to P*, if P° is fixed and (30) is satisfied. This 
clearly implies that the contribution of a sufficiently small vicinity of P° to 
the n-fold integral (22) is finite. Since P®° is any point of the closed, bounded, 


len 
es, 
6) 
ur 
et 
at 


268 PHILIP HARTMAN, E. R. VAN KAMPEN AND AUREL WINTNER. 


(n — 2)-dimensional manifold (13), and since this manifold consists of the 
zeros of the denominator of (11), the proof of (22) is complete. 

The admissibility of the identification of the space-average with the time- 
average as mentioned at the beginning of the paper may be treated as follows: 
Consider the transformation 

Tt = >On) 
which sends a point 
of ® into the point 
+ Ant + On) 


of ®. Thus 7; is a measure-preserving transformation of ® into itself, satisfies 
the group condition 7,rz = r,s, and is of the metrically transitive type * in view 
of the linear independence of the Ay. Hence the ergodic theorem of Birkhoff ’ 
is applicable to every L-integrable function v = v(#,- - -,%) on @ and thus, 
according to (22), to the function yy. Hence on excluding from © a set 
of points (9;,° - -,0%n) of measure zero, the time-average (3) of the function 
g(t) = x(Ait + Ant + dn) 

exists and is equal to the integral of the function (11) over the torus ®. This 
means in view of (15), (17) and (1) that on keeping the frequencies % and 
the amplitudes a, fixed and on excluding from the n-dimensional space of the 
phases a, a set of measure zero, (7) holds for the derivative ¢’(t) of the 
angular function ¢(t) of (1) and for the asymptotic distribution function 
o(€). Finally, (3) follows from (7) in view of (8). 

It may be mentioned that on excluding, for fixed a, and Ax, a set of phases 
(a,° * *; @) which is (7 —1)-dimensional, hence of measure zero, the func- 
tion (1) is distinct from zero for every ¢, so that one can choose in (2) the 
sign + for every ¢, in which case | z(¢)| and ¢(t¢) become polar codrdinates 
in the (2, y)-plane. It is known ® that, in virtue of the linear independence 
of the Ax, the asymptotic distribution function of the polar angle $(t) thus 
defined is a circular equidistribution, since the asymptotic distribution of (1) 
in the (2, y)-plane is of radial symmetry. 


THE JOHNS HOPKINS UNIVERSITY. 


BIBLIOGRAPHY. 


[1] G. D. Birkhoff, “Proof of the ergodic theorem,’ Proceedings of the National 
Academy of Sciences, vol. 17 (1931), pp. 656-660. 


Wintner [10]. 


| 
| 
| 
ret 
(1 
(5 
tio 
ma 


MEAN MOTIONS AND DISTRIBUTION FUNCTIONS. 269 


[2] G. D. Birkhoff, “ Probability and physical systems,” Bulletin of the American 
Mathematical Society, vol. 38 (1932), pp. 361-379. 

[3] P. Bohl, “ Uber ein in der Theorie der sikularen Stérungen vorkommendes Problem, 
Journal fiir die reine und angewandte Mathematik, vol. 135 (1909), pp. 
189-283. 

[4] G. H. Hardy and J. E. Littlewood, “On Lindeléf’s hypothesis concerning the 
Riemann zeta-function,” Proceedings of the Royal Society, ser. A, vol. 103 
(1923), pp. 403-412. 

[5] B. Jessen and A. Wintner, “ Distribution functions and the Riemann zeta function,’ 
Transactions of the American Mathematical Society, vol. 38 (1935), pp. 48-88. 

[6] A. Khintchine, “Zu Birkhoffs Lésung des Ergodenproblems,” Mathematische 
Annalen, vol. 107 (1933), pp. 485-488. 

[7] H. Weyl, “ Sur une application de la théorie des nombres 4 la mécanique statistique 
et la théorie des perturbations,” Enseignement Mathématique, vol. 16 (1914), 
pp. 455-467. 

[8] A. Wintner, “ Diophantische Approximationen und Hermitesche Matrizen,” Mathe- 
matische Zeitschrift, vol. 30 (1929), pp. 290-319, more particularly pp. 312-316. 

[9] A. Wintner, “Sur l’analyse anharmonique des inégalites séculaires fournies par 
l’approximation de Lagrange,” Rendiconti della R. Accademia Nazionale det 
Lincei, ser. 6, vol. 11 (1930), pp. 464-467. 

[10] A. Wintner, “ Upon a statistical method in the theory of diophantine approxima- 
tions,” American Journal of Mathematics, vol. 55 (1933), pp. 309-331. 


” 


ERRATA. 


In the paper of E. R. van Kampen and Aurel Wintner, “ On a symmetrical canonical 
reduction of the problem of three bodies,” American Journal of Mathematics, vol. 59 
(1937), pp. 153-166, read 18 instead of 9 in formula (55,) on p. 165 and in formula 
(59) on p. 166. 

In the paper of E. R. van Kampen and Aurel Wintner, “‘ Convolutions of distribu- 
tions on convex curves and the Riemann zeta function,” American Journal of Mathe- 
matics, vol. 59 (1937), pp. 175-204, 


page line . instead of read 
188 14 = €(e) = 
192 15 tangent normal 
192 16 the the normal 
192 18 2/ 2/ 
193 1 h k 

193 14 < > 


194 12 Max ( Max (G4, 


= 


ON AN ABSOLUTE CONSTANT IN THE THEORY OF 
VARIATIONAL STABILITY.* 


By E. R. van Kampen and AUREL WINTNER. 


If p(t), —%°0 <t<-+ o, is a real continuous periodic function, the 
linear differential equation 


d?z 
(1) + p(t)z =0 
is known to possess two solutions = = of the form 


(2) x, = = et/TH,(t), 


where f,(¢) and f(t) do not vanish identically and are periodic with the same 
period 7’ as p(t). The numbers A and —4A, the characteristic exponents, are 
determined mod 2zi by the reciprocal quadratic equation 


(3) eh 11 —0, 


this equation being the characteristic equation of a real binary linear sub- 
stitution of determinant 1. If A is not a multiple of Ti, it is clear that the 
two solutions (2) are linearly independent. If A is a multiple of 7'm, it 
depends on the elementary divisors of that binary substitution whether or not 
the general solution of (1) is free of secular terms. The equation (1) 
determined by the given periodic function p(t) is said to be of the stable type 
if every solution z(t) remains bounded as t— + o, i.e., if the elementary 
divisors are simple and A lies on the imaginary axis of the A-plane. Since 
from (3) 


it follows that 


(4) 
is necessary and that 
(5) —1<A<1 


is sufficient for the stability of (1). In fact, a multiple elementary divisor is 
impossible unless there is a double root, so that A? = 1 is a necessary condition 
for solutions with secular terms. 

The determination of the solutions (2) in terms of p(t) requires, in 


1 Received February 22, 1937. 
270 


( 
I 
| 
© 
| 
( 
I 
( 
( 
i 
( 
0 
A + (A? —1)3, 
t] 
€ 
t 
( 
li 
T 
0! 


ON AN ABSOLUTE CONSTANT IN THE THEORY OF VARIATIONAL STABILITY. 271 


general, an application of infinite determinants or of equivalent transcendental 
processes. Even the determination of the characteristic exponents + A is quite 
involved, it being defined by the zeros of Hill’s fundamental determinant or 
by the characteristic equation (3) of the monodromy matrix, i.e., by the 
number A. The latter can be represented, according to Liapounoff,? by means 
of the convergent series 


where, if 7 > 0 denotes the period of p(t), the number A» is the definite 
integral 


ty tn-1 
(7) An— 4 f at, 
0 0 0 


while the function @, is defined in terms of the primitive function 


(8) P(t) J 
of the coefficient p(t) of (1) as follows: 


(9) On (th, ° tn) 
= {P(T) — P(t) + P(ta) P(t.) — P(te)} {P (tra) —P (tn) }. 


In particular, Q, is the constant P(7’), so that, from (7) and (8), 


(10) A, = 4TP(T) —ir p(t) dt. 
0 


Since the actual determination of + dA requires, in view of (3), (6), (7), 
(8), (9), highly complicated operations, it is natural to ask for criteria which 
impose a less remote condition on the coefficient p(t) of (1) and assure that 
(1) is of the stable type. First, if p(t) is a positive constant, (1) is clearly 
of the stable type. On the other hand, if the positive periodic function p(t) 
is not a constant, (1) need not be of the stable type. This holds also when 
the deviation of p(t) > 0 from a constant is less than an arbitrarily small 
«> 0. Examples to this effect are implied * by the theory of Mathieu’s equa- 
tion, where 
(11) p(t) =c, cost 30<¢). 


* A. Liapounoff, “ Sur une série relative 4 la théorie des équations différentielles 
linéaires 4 coefficients périodiques,” Comptes Rendus, vol. 123 (1896), pp. 1248-1252. 
This investigation of Liapounoff is reproduced on pp. 425-431 of vol. IV, part III (1902) 
of Forsyth’s Theory of Differential Equations. 

* Cf. M. J. O. Strutt, “ Wirbelstréme im elliptischen Zylinder,” Annalen der Physik, 
vol. 84 (1927), pp. 485-506, where further references are given. 


272 E. R. VAN KAMPEN AND AUREL WINTNER. 


Now there exists, according to Liapounoff,? an absolute constant 2 > 0 
such that if a real continuous function p(t) of period T is non-negative for 
every ¢, positive for some ¢ and satisfies the inequality 


(12) rf 


then (1) is necessarily of the stable type. That « cannot be arbitrarily large, 
is clear from the example of the Mathieu equation mentioned above. It is 
indicated by Strutt’s diagram,* which has been calculated numerically for the 
boundary curves of the stability region of (11) in the (¢,, c,)-plane, that the 
best possible value of the absolute constant .« cannot be much greater than 5. 
On the other hand, every «= 4 is admissible. In order to see this, it is, 
according to Liapounoff, sufficient to observe that, as shown by (8), (9), (7) 
and (10), the assumption 

(13) 05S p(t) 


obviously implies the inequalities 


(14) Anu < n+1 An > 0. 


In fact, if (12) is satisfied by an « < 4, then 


in view of (10) and (14), so that the number (6) satisfies the sufficient con- 
dition (5) of stability. 

Now it will be shown that « = 4 is the exact value of the absolute constant 
in question. Thus there does or does not exist a continuous periodic function 
p(t) which satisfies (13), (12) and makes (1) a differential equation of the 
unstable type according as « > 4 or a= 4. It also will be shown that «= 4 
remains the greatest admissible value of « also when one restricts p(t) to be 
an even function of ¢. Finally, it will be seen from the proof that nothing is 
gained if one requires that p(t) is analytic or if one replaces (13) by p(t) > 0. 

Needless to say, the greatest admissible value of « is independent of the 
value of the period 7. This is seen from (12) if one replaces ¢ in (1) by ¢/, 
where c > 0 is arbitrary. On choosing 


(15) T = 1, 


it follows that, since every «= 4 is admissible, the statement to be proved 
may be formulated as follows: There exists for every « > 0 a real non-negative 


an 


0 
( 
a 
st 
( 
p 
( | 

so 
It 
(1 
an 
(2 
In 
he 
(2 
in 
(2 
Si 
(2 
Si 
(2 
No 


ON AN ABSOLUTE CONSTANT IN THE THEORY OF VARIATIONAL STABILITY. 273 


continuous function p(t) +0 of period 1 such that, on the one hand, the mean 
value of p(t) satisfies the inequality 


(16) ff, te) 


and, on the other hand, the characteristic exponents + A of (1) are not of the 
stable type. 
For two fixed numbers £, » which satisfy the inequalities 


(17) 0<p<t, B>0, 
put 
p(t) = (u—t)B/p, if OStSp and p(t) —0, if »StSh, 


(18) —p(1—2), if << <1, finally p(t +1) p(t), 


so that p(t) #0 is an even, contiruous, non-negative function of period 1. 
It is easy to see that, for this p(t), 


(19) A, = $hp 
and 
(20) A, < Bp). 


In fact, (19) is clear from (10), (15) and (18). Furthermore, from (18), 


p(t) =0, if pStS1—ypz; 
hence, from (8), 


P(t) —P(te) =0, if 
and so, since 


(21) Q2(t:, = {P(1) — P(t.) + P(te) }{P(t.) — P(te)} 
in view of (9) and (15), 
(22) Qo(ti, te) = 0, if 


Since (8) is a non-decreasing function in view of p(t) 20, it is seen from 
(21) that 
Qo(ti, te) S {P(1)}*, whenever 


Since P(1) = Bp in view of (10), (15) and (19), it follows that 
(23) Qo(t, S whenever 0S#, 54,51. 


Now if § denotes that portion of the triangle 0 S t, St; [1 in the (t,, t2)- 
4 


R74 E. R. VAN KAMPEN AND AUREL WINTNER. 


plane on which pS t, S t; [ 1 — yp does not hold, then it is seen from (7), 
(15), (22) and (23) that 


ff Qo(ts, ff Qo(ts, ta)dtedt, = Brut dtedl,, 
0 0 8 8 


This proves (20), since, by the definition of the region S, 


Sf dt,dt, = area of S < 2. 
Ss 


Now let « > 0 in (16) be given. Since only small values of ¢ need to be 
considered, one can assume that e < 4 and then choose the numbers £B and yp, 
which occur in (17) and define the pericdic peak function p(t) = 0, in such 
a way that on the one hand 


(24) Bu—4(1+6) 


and on the other hand p < «/(4 + 4e)*. Since the latter inequality implies, 
in view of (20), that 
A, < + 4)’, 


it is seen from (24) and from the assumption e < 4 that 


(25) A, < € < 4. 
On the other hand, 
(26) A, =2(1+¢) 


in view of (19) and (24). Since, by (25), (26) and (14), 


it is clear that 
0< A,—A; + < Ao. 


Hence A < 1— A, + A; in view of (6). It follows, therefore, from (25) 
and (26) that 


A<1—A,+¢e=1—2(1 +c) 


Consequently, (4) is not satisfied. Since (4) is a necessary condition for 
characteristic exponents +A of the stable type, and since (16) is satisfied 
in view of (10), (15) and (26), the proof is complete. 


THE JOHNS HOPKINS UNIVERSITY. 


| 
é 
| 
| 
| 
| 
| 
f 
| 
7 


ON THE EXPANSION OF THE REMAINDER IN THE OPEN-TYPE 
NEWTON-COTES QUADRATURE FORMULA.* 


By G. Harroxp, JR. 


1. The result that the remainder in the Newton-Cotes quadrature formula 
can be expanded in a series of the Euler-MacLaurin type has been established 
by J. V. Uspensky.t Inasmuch as the open-type quadrature formula of this 
kind discussed by J. F. Steffensen ? is important in the numerical integration 
of differential equations, the question has been raised as to whether or not an 
analogous development holds for the open formula. The answer is in the 
affirmative. 


2. We consider, without loss of generality, the integration interval (0,1). 
The unit interval is divided in n equal parts. The function f(z) to be in- 
tegrated over this interval is assumed known at x = 1/n, 2/n,---,(m—1)/n. 
The coefficients, or weights, in the quadrature formula will be denoted by A;: 


wn (x) dx 
Aim J w'n(t/n) ’ 


= (2 —1/n) (e—2/n) (e@—(n—1)/n). 


It is convenient to introduce the symbol 
Ky = A,By(1/n) + AsBo(2/n) ++ AnsBv((n—1)/n), 
where By(z) is the Bernoullian polynomial of degree v. 


3. With the above notations, the result of this investigation can be 
formulated more explicitly as follows: 


Let f(x) be a continuous function on 0 = x =1, with as many continuous 
derivatives as are needed in the discussion; then 


* Received November 10, 1936; revised December 28, 1936. 

* J. V. Uspensky, “ On the expansion of the remainder in the Newton-Cotes formula,” 
Transactions of the American Mathematical Society, vol. 37 (1935), pp. 381-396. 

*J. F. Steffensen, Interpolation. Williams and Wilkins Co., Baltimore, 1927. 


275 


h 
| 


276 ORVILLE G. HARROLD, JR. 


1 n-1 m+ée-1 (0) 
Aif(i/n) — > t 
0 4=1 v=m=[n/2] ( v) 
(28+2m) 
= — K,,28+2m f (€) 0 < E 


(2s + 2m)! 


s being an arbitrary positive integer. Further, if the even ordered derivatives 
keep thetr sign on 0O= 251 and if their signs are all alike, then when the 
sertes is truncated after a certain term, the error committed is of the same sign 
as the next term and in absolute value is less than it. 


4. By the Euler-Maclaurin summation formula 


B,(z) is the Bernoullian periodic function of order v, r being an arbitrary 


positive integer. Fixing 20 throughout this discussion and allowing 6 to 
take on successively the values 1/n, 2/n,- - -, (n—1)/n, we get 


Multiplying by A; and senasitaede from 11 to n—1, there results (since 
J, fat Agfli/n) — afr” (0) 
{A,B,(1/n —t) + A2B,(2/n—t) 
4+ +++ An, Br((n—1)/n—t) }dt. 


For 2m —1, m= [n/2] 


1 
(*) Ky — By(2) de —=0. 

0 
Since Aj = An; and Buoy, (i/n) = — for all values of », 
we have 


0. 


| 
! 


THE OPEN-TYPE NEWTON-COTES QUADRATURE FORMULA. 277 
Thus the integral may be presented, for r = 2s, 


f(t)dt Auf (k/n) _ Kn2” Afr? (0) 


(at 


where 


Aa (Bas i/n — t) — Bay (i/n) 


To establish the result asserted in § 3 it suffices to show: 


1°, the numbers K,?", K,?™*?,- - - alternate in sign; 
n-1 = 
2°, the quantity > Ai{Boe(i/n —t) — Bog(t/n)} keeps its sign on 
i=1 
0<t<l. 


For if 2° is true, we may write, by virtue of (*) 


(é) 
Rosy = (2s) ! Kn O< € < 1. 


5. To establish the points in question the following properties of 


Ou(t) = Be(i/n —t) — Br(i/n)} 


are noted: 


(1) Qu(t) = (—1)*Qx(1 — 1) ; 


(2) Qx(t) is continuous for k = 2,3,---, Q.(t) possesses derivatives 
of orders 1,2,---,k—1,Q.%*(t) is not differentiable at 
t=—1/n; 


(3) = (2 —1) —2)Qae-a(t), 
(k= 2,3,° (ti/n, k—=2)), 


(t) — 2h (2k —1) + 
(4) Qx(0) = = 0, (k= 1, 


Let @, and £;, denote respectively the number of distinct zeros of Qox-1(t) and 
Qx(t) on0<t<1. By virtue of (1), fork —1,2,---, 


(5) = 1. 


278 ORVILLE G. HARROLD, JR. 


From the fact that Q(t) has 6, + 2 distinct zeros on 0S=¢t=1, we get, 
by the use of (3) and Rolle’s theorem, that Qox-1(¢) has at least By, + 1 dis- 
tinct zeros on 0 < t < 1; thus, 


(6) By + 1S %. 


Due to the fact that Qox-1(¢t) has 2 distinct zeros on 1, we get 
by (3) and repeated application of Rolle’s theorem that Q.x-3(¢) has at least 
a, distinct zeros on 0 < ¢t < 1; hence, 


(7) S 


If it can be shown that am—1, then, by (6) and (7) Bm, Bmi,° °° are all 
zero. Thus, one of our contentions, namely, that 


n-1 
Ai {Bos (t/n t) Bog (t/n) } 
=1 
keeps its sign on 0 < ¢ < 1, will be established. 
6. Consider the function Qom-_,(t) => As{Bon-1 — t)— Bom1(i/n) }. 


n-1 1 
0 


we may make the following simplifications: 


Qom-1(t) = LAiBom-1(i/n —t) = Ai —t) +2 A:Bom-s(t/n —t); 
i=1 i/n> 
now 
n-1 n-1 
Ai Bom-1(1/n — t) > Ai Bom-1(1/n— t) 
i/n>t i/n>t 
n-1 
Ai Bom-1(1/n — #) — — t), 
i=1 
hence 


Qon-a(t)—= SA Bons (i/n—t) + Ai{Ban-s(1 +i/n—t)— Bans (i/n—t)} 


—SA Ben (i/n — Ai(1/n— t)2m-2 (2m — 


i/nsSt 


1 
The first term on the right is f (« —t)da = — t?™", By comparison of 
0 


W 
| 
| 
n 
as 
(] 
| 
€ 
th 
| 
fr 
In 
alt 
(2 
0: 


THE OPEN-TYPE NEWTON-COTES QUADRATURE FORMULA. 279 
with 
n-1 
f, = +B 
0 i=1 


we see that Ry(t) = — Qom-1(t) /(2m — 1) is precisely the remainder obtained 
when the open formula is applied to 


(x —t)?m-, t, < t < i, 


Setting 


— ])kf2m-k-1 

let the number of distinct zeros of Ry(t), for k =0,1,- - -,2m—8 be de- 
noted by No, Nom-3. Let the number of variations in sign of 
as t varies from 0 to 1 be denoted by Nom-2. Evidently R;(t) is continuous 
(k = 0, 1, 2m — 3) and possesses a derivative R’;,(t) = —(2m — k — 2) Riss (#), 
(k=0,1,- - -,2m—A4); but Rom_3(t) does not possess a derivative at t =i/n. 

By the property of the open quadrature formula 


Ry, (0) = Ry(1) = 0; 


hence, has N;,-+ 2 distinct zeros on so that, by Rolle’s 
theorem, and the fact that R%,(t) = — (2m —k— 2) we get 


(A) Nix + 1 = 
from which it follows that 


No (2m — 3) 


In particular, Nom; +1 Noms. From the fact that the coefficients A, 
alternate in sign for i m,° it follows that 


Rem-2(t) = i/n t, 


can have at most 2m — 1 variations in sign in0 << ¢t<1whenn=2m. For, 
we note R2,(t) cannot change sign more than twice in each of the subintervals 
(2i—1)/n St < (2i1+1)/n, while it definitely does not change sign in 
0<t<1/n, and has at most one change of sign in (2m—1)/2mSt <1. 
Hence, for even n, 

Nom-2 2m — 1, 


* See Lemma J below (n#5). 


280 ORVILLE G. HARROLD, JR. 


from which Nom: S2m—2, or No= +1. But thus 


tm = +1, Bm = Bn = 0. 
If n = 2m + 1 we use the relations 


Reom-2(t) =t — i/n = t, 
By(t) = 


which follow immediately from the definition of R(t). 
Two cases are distinguished : 


1°. n=4k+1. Asbefore, does not change sign in0 St < 1/n, 
and not more than twice in each (20—1)/nSt< (21+1)/n, i=1,2,:--, 
k—1. It changes sign at most once in (m—1)/nSt < m/n since Am = Az, 
is negative by Lemma 1 and Rom_2(m/n) is obviously negative. Thus Rom-2(t) 
has at most 4(4 —1) + 2+ 1 2m —1 alternations of sign on 0 << <1; 
hence, 2m —1, No = + 1, and as before, Np = + 1. 


2°. n=4k+ 3. Again, does not change sign in 0 St < 1/n, 
and not more than twice in each (21—1)/nSt < (21-+1)/n. However, 
three alternations are possible in (m—1)/nSt=(m-+1)/m so that 
Nom-2 = 2m-+1. It follows that Rom_3(¢t) has at most 2m zeros on 0 < ¢ < 1. 
But Rem-s(t) is an even function of ¢ with respect to t = 1/2, so if Rom_3(t) <0 
at ¢ = 1/2, Nom-s = 0 (mod 4); but Noms S 2m = 4k + 2, so that 
Nom-3 S 2m — 2, hence No = + 1. 


If Rem-s3(1/2) <0, an impossible situation arises. Since 
—Rons(t) = #/2 +E 


our assumption implies that 3A;(i/n) = 1/8, i/n < 1/2, which is (n3) 
false by Lemma 2 below. Thus in all cases ¢m 1, Bm =0 (n 483, 5).4 


7. The functions Qom(t), Qems2(t),° - do not change sign on 0 
They are periodic functions with continuous derivatives, and such that 
Q’2.(0) = 0, so that has the same sign as Q”.,(0). From § 5, 


n-1 


Q” (0) Ai Bomsox-2(1/n), 


and 


1 n-1 
0 


“9 deals with the expansions for n = 3, 5. 


| 
| 


THE OPEN-TYPE NEWTON-COTES QUADRATURE FORMULA. 281 


It is thus evident that the coefficients K,?” alternate in sign for fixed n, 
and v==m,m-+1,:--. Both assertions of § 4 have now been established. 


8. It remains to establish the two lemmas mentioned in § 6. 


Lemma 1. The coefficients A; satisfy the following inequalities for 
isn/2: 
A, >0, odd, 
A; < 0, a even, 


while for 1 > n/2 we use the fact that Ay = An-i. 


To demonstrate that the A; alternate in sign it is convenient to modify 
the notation slightly to indicate the dependence on n. Set 


Ay = Any = (1/n) dz, (k= * -,n—1), 


where 


2): 


Since Py(k) = 


(—1)*-*1 on 


Setting In— f" P,(x)dz, and recalling well-known formulas for Gamma 
0 


Ank 


functions, we find 


n-1 1 n } 
Jeu (—1) f &1(1— é)*"dé et log &/(1-£) sin dx. 
vis e 0 0 


z—k 


The last integral may be split up and presented as 


n-k 
[ve da = (— 1)*(&/(1—é))* gt log £/(1-€) dt 
0 


0 


h Sin a 
dz = 2/2 + arc tan + ) 


T 


Using the formula 


we get 


282 ORVILLE G. HARROLD, JR. 


log £/(1-£) sin 


log €/(1-§) p(n-k)e 


-co +. 


00 2? 
Carrying out the substitution in J,, we get 


Jn = (—1) (k) — 
1 log §/(1-&) p(n-k) 
+r(n) { f 


1 log €/(1-) ke 
(— 1)**-1 Ja 


Since An, = 


can now write 


log £/(1-§) p(n-beq 
+ { (—1)" fe — 
0 


n 
1 log &/(1-§) ekt dx 
-1 n-k- 
+ 


The quantity within the brackets becomes, after change of variable in the 
inner integrals, 


1 1 fed (— 


If & is odd and S n/2, we see at once that An, > 0 for n even or odd. If k 
is even, 


and, since the integrand is everywhere non-negative for k S n/2, 


= k-1 -1 n-1 

or 


On integration of the inner integral 


T 
( 
| 
i 
| 
] 


THE OPEN-TYPE NEWTON-COTES QUADRATURE FORMULA. 283 


>—1+(n n? + log?((1— €) /é) 


—(n— + log*((1—€)/E) 
| + log?((1 — €)/é) 


The term 


is less than 


which is less than 


(n 1) k-1 n-k-1 n-k-1 
fork = n/2. Hence the term 
1 % (1 &)**dé eX é)*dé 
is less than 1/kr*. Since 
+ log? aw + log?((1—€)/é)’ 
we have 
hence 
(—1)" n n] 7 
(B) 


+ By calculation, A,2,. and A,s,. are both negative quantities. Considering n odd 
and even separately, we see readily that the right side of (B) is positive for 
n=12, and for even kn/2. Thus for all k= 7/2, (n5), 


284 ORVILLE G. HARROLD, JR. 
> 0, An,2l < 0. 
For n less than 12 we refer to tables.? 


LemmA 2. Ai(t/n) > 1/8, n=3 (mod 4) (n> 8). 
1/n<a 


To establish this inequality we start from an sapreetion for An, used in the 
proof of the previous lemma: 


It follows that 


m m(m + 1) as 


1 m (— 4. = 2m-k} 
7m + log?(((1—€)/é)t) 


1 
It suffices to show the double integral (second term) exceeds (2m +1)?" To 
this end we start with the identity, valid for p+ q —1, 


fora — x) "dx 
0 


(1 — 2) 
0 


Setting z= _— in the integrand of the numerator on the right side, we 


—3 
obtain 


yr) /2dy 
1)‘ 1)/2 


(n-3)/2]y 
0 


whence, putting p —— tq (so that g = (1—¢)~), 


(n-3) /2 


0 —y)"* 


| 
(— 


the 


THE OPEN-TYPE NEWTON-COTES QUADRATURE FORMULA. 285 


Further simplification gives 


y™ dy 


m-1 


which upon differentiation leads to 


m-1 m-1 
—> (— 1) #{2m-1-k — 2m = (— 1) #¢2m-1-% 
k=0 


This relation may be written as 


m-1 
(— 1) *¢2m-k-1 
k=0 


To get the other terms, we notice 


2m-1 


(¢ — 1) 2m-1, 
so that 


m-1 
a o (1—y)™ 


and, again by differentiation 
m-1 
(% + 1) (1 — t)?™-2(1 — 2mt) 
+ mC™ om_ (1 — t)?m- (1 —2mt) (1 < > 1 


Denoting the polynomial in the inner integral in the formula for Anx 
by F(t), we have 


(1—t)?™2(1 — 2mt) 


Setting 


&=0 


286 ORVILLE G. HARROLD, JR. 
it is evident that y(0) —0 and that y(t) > 0 for 0 < ¢ <1, since 


y(t) tert 


Thus, by integration, 


> 0. 


t t t 
$(t) -f (1—2)™#(1—2me)da + f de, 
0 0 0 
t 
—t(1—t)2m-1 4 y(«)dx > 0, 
0 
for0 <t< 1. This expression we use in the integral in (A): 


P(t)dt ¢(1) 
o + log?(((1—€)/é)t) + log’?((1— /é) 
*_(t)2 log(((1—€)/é)t) dt 
Jo [x + log?(((1—-€)/é)t) ]? 

¢(1) 
If 1/2 <<é< 1, log((1— é) /é)t < 0; if 0< €=1/2, log((1— /)t < 0 
for t << €/(1—€). We seek now upper bounds for the absolute values of the 

negative terms: 


(t)2 log(((1—é)/é)t) dt 
+ log?(((1—€)/é)t)]? ¢ 
p(t)? log(((1—€é)/é)t) dt 


1; 


0<€éS1/2. 


Observing that 
1 
$(t) 4+ mo, +e ~) 
1 — y™ dy 
2m-1 


we have, since the last two terms of the bracket are negative, 


2m +1 1 2m +1 


2m-1 1m m-1 m 


for 0<t<1. By virtue of these inequalities, the first integral is less in 
absolute value than 


1 2m + 1 
2m 2(2m—1) 


+ log? 


C™om-1 


? 


i 

fe 

| 


THE OPEN-TYPE NEWTON-COTES QUADRATURE FORMULA. 


and the second is less than 


Considering 


we see that this is greater than 


€1(1 — €)2mdg 1 | 2m+1_ €*(1— 
log?((1—€)/é) \2m T 3(2m—1) 1) ms) + log?((1— €)/&) 


f ea—s {gt (5) as, 


which exceeds 


— 
6) = + ((1— €) /é) 


2(2m —1) maf + log?( (1 — €)/é) =f (1 é) dé 


(2m + 1 ) m m-2 


+ 1)2?m+1 


The second, third, and fourth terms are respectively not greater than 


1 (m + 1) (2m + 1) 
(2m —1)2?™+1? 4n?m(m — 1) (2m —1)° 


Since 


0 log?((1—€&)/€) 2m + 2 +1) 


we have 


F(t) dt 
1—é 2m 


Im 1 2m+1 
Ym, orn +< 
C™om-1 (= are tan log (2m 1) 


me 2m (2m — 1) 
CO" 1 (m + 1)(2m +1) 
a” (2m — 1) 2?™*1 (m — 1) (2m — 1) 


The coefficient of the integral is 2m/(2m + 1)?; hence, we are concerned with 
the truth of the inequality 


287 
é m-1 ) 
| 


288 ORVILLE G. HARROLD, JR. 


arc tan ( ) 
2m \2m + 3) log(2m-+1)/7 __ 1 


(2m + 1)? 2m (2m — 1) (2m — 1) 221 
(m+ 1) (2m +1) > 1 
42?m(m—1)(2m—1) 8(2m + 1)?’ 
or 
2m + 1\?™1 
Gs) arc tan log(@m 1) 
2m(2m — 1) a? (2m —1)2?™+2 


(m + 1)(2m +1) 1 1 
4n?m(m — 1) (2m — 1) + Fm 


> 


which is fulfilled for m = 5. 
The inequality 3A;(i/n) >1/8 is also fulfilled for n—7% by direct 
calculation, but not for n = 3. 


9. The coefficients As, do not alternate in sign. Noting in particular 
that (A), § 6, is valid, 
Not1SM,. 


Since V, + 1 Nz, it follows that Ny + 2 = N2, where N, is the number of 

sign changes of Rom-2(t) as ¢ increases from 0 to 1 for m2. From tables 

the graph of Rem2(t) —=t— > A; presents precisely three changes of sign 
4/5St 


on0<t<1. Hence, 
No+23538, 


or VN,» = +1, which was to be shown. 


If n = 3, there are only two coefficients, which are equal, and hence there 
is no question of alternation of sign. The expansion has, in this case, the 
following form: 


afer» (0) 
(2v)! 


+ f Ai {Bos (i/3 — t) — Bos (4/3) Jat, 


> Aif (i/3) — 


where 
é 1-2y 
(1— 3) 


= A;By(1/3) + AsBuy(2/3) 


hence, K;”” alternates in sign for v = 1, 2, 3,- - - 


| 


iq 

| 


THE OPEN-TYPE NEWTON-COTES QUADRATURE FORMULA. 289 


The mean value theorem can be applied as before to put the remainder 
in the previously given form since 


Qos (t) 4{B..(1/3 —t) + Boe(2/3 —t) 2Bos(1/3) }, (s=1, 


does not change sign on 0 <¢t< 1. By direct methods this is evident, for on 
0<t< 1/8, 
Q2(t) =, 

and 

Qs(t) + 1/6, 
On 1/38 <t < 2/3, 

Q2(t) =? —t+ 1/8, 

=—# + t/6 + (3/2) (1/3 


and thus 8B; = 0,¢,—1. From +155 4s, and as S @s_1, it is evident that 


Bo = Bz =: --=0. 


STANFORD UNIVERSITY, 
CALIFORNIA. 


5 


ON CERTAIN FUNDAMENTAL IDENTITIES DUE TO USPENSKY.* 


By W. A. Dwyer. 


1. Introduction. In a series of memoirs entitled “Sur les Relations 
entre les Nombres des Classes des Formes Quadratiques Binaires et Positives,” 
Uspensky has obtained several very general fundamental formulae involving 
incomplete numerical functions in three variables. He has made use of these 
to establish a great variety of interesting and useful arithmetical theorems.? 
Uspensky’s proofs of the fundamental formulae are purely arithmetic and of 
great simplicity, but give no clue as to how a systematic determination of such 
formulae may be made. They are in the nature of a priori verifications. The 
structure of the identities suggests that they may be gotten from equivalent 
identities involving the theta functions by means of the method of paraphrase.* 
If such an identity can be found, it will suggest a systematic determination, 
by analytical means, of all identities of this type. From this set of identities 
it would then be possible to pass back, by means of the method of paraphrase, 
to other general identities, and then to their application to arithmetic. Bell,* 
for example, has discovered a theta function identity which paraphrases into a 
certain fundamental formula of Uspensky involving complete numerical func- 
tions. In this paper we shall establish two of the formulae involving incom- 
plete numerical functions as special cases of a general formula which, in turn, 
results from the paraphrase of a rather peculiar theta identity. 


2. Let F(x, y,z) be a function defined for integral values of the argu- 
ments and subject to the parity conditions 


F(—2,— y,—2z) =— F(a, y,z), F(0,0,0) = 0. 


Then there exists an identity involving incomplete numerical functions in 
three variables 


* Received January 26, 1937. 

1 Bulletin de Vv Académie des Sciences de V’U.8.8.R., 1925, 1926. 

2 J. V. Uspensky, loc. cit., Quatriéme Memoire, 1926, pp. 547-566. Also, American 
Journal of Mathematics, vol. 50 (1928), pp. 93-122; Bulletin of American Mathematical 
Society, vol. 36 (1930), pp. 743-754. 

FE. T. Bell, Transactions of the American Mathematical Society, vol. 22 (1921), 
pp. 1-30, and 198-219. 

*E. T. Bell, Bulletin of the American Mathematical Society, vol. 32 (1926), pp- 
682-688. 


290 


— ot 


| 


ON CERTAIN FUNDAMENTAL IDENTITIES DUE TO USPENSKY. 291 


I) 3F(8+i,8—d+i,i) 
+ ¢(n)T +a(n)L, 


with integral partitions 


II) a) n=?+2d8, i120, 8>0, d>0, 
b) n=h? + Ad’, hZE0, 0<A<A’, A’ =A (mod 2), 
c) n=s*, s>0, e(n) =0 or 1 according as n is not or is a perfect 
square, 
d)n=r+#, r>0, t>0, a(n) —0 or 1 according as n is not 
or is a sum of two squares. 


If parity conditions be restricted, corresponding identities result as follows: 


= F(a, y, 2), F(«,— y, 
= — F(z, y,z), F(x, 0,2) =0, 
III) 4 SF (8 + i,8— d +i,4) 


F(—za, y, 2) =— F(2,y,z), y, —2) = F(z, y, 2), F (2,9, z) = 0, 
SP(8 + i,8—d + i,i) 


2 


Formulae III and IV are the same as certain formulae discovered by Uspensky ° 
and proved by purely arithmetic methods. 


3. The theta identity and its paraphrase. Formula I results from the 
attempt to find a theta-function identity which would paraphrase into III and 
IV. The procedure consisted in going backwards from these two, and involved 
the selection of terms, which when arithmetized would meet the conditions 
IT(a) and II(b), and proper adjustment of the arguments. The left side of 
III or IV suggests the product of a theta and a function of the type 


+ y) 
pave (2, y) 


5J. V. Uspensky, Quatriéme Memoire, loc. cit., and “On incomplete numerical 
functions,” Bulletin of the American Mathematical Society, loc. cit., p. 746. 


= 


292 W. A. DWYER. 
The function desired, and its arithmetic equivalent ° is 


V) O(a + y,—y) 
+ {ctn(x + y) —ctn(y)}-°3 cos 


For the terms of III (or IV) involving incomplete numerical functions we 
shall employ an expression 


VI) => — rar) = ctn +2 gq” sin 2ny 


443 
0< n= d, 0<d< 3, § =d (mod 2), 


which appears as a term in the Fourier development of certain pseudo-periodic 
functions." By synthesis, we arrived at the relation 


VII) 
= 03(z)x(@ + y, + 2) + 2)x(y, 2). 


An independent proof of this result will appear in § 4 and we shall proceed 
with the paraphrase of VII. Applying to it the arithmetized expansions of V 


and VI and the formula #;(z) = > gq” cos(2nz), we obtain 


VII) sin 2[ (8+ i)e + (83—d +i)y + iz] 
= 45 gh sin[ (A’ + + (A’—A)y + 2(A—A)] 
—sin[— 2ha + (A’— A)y + 2(A—h)z]} 
+ ctn(x + y) {3 ¢*{cos(2iz) — cos 2i(a + y + z)}} 
— ctn(y) {3 2i(a + z) —cos + y + z)}} 
+ 2% S{cos(2iz)sin + 2) — cos 2i(x + z) sin 2¢z}, 


° Cf. E. T. Bell, “ Theta expansions useful in arithmetic,” Messenger of Mathematics, 
vol. 53 (1924), pp. 166-176. 

7™M. A. Basoco, “ Fourier developments for certain pseudo-periodic functions in 
two variables,” American Journal of Mathematics, vol. 54, no. 2 (1932), p. 242. In 
this connection we shall exhibit the periodicity properties of x(#,y). The function is 
obviously periodic when « or y is increased by 7. Furthermore 


+ nr7,y + = 
x (a, y ) = e-2niry (a, y) 
n-1 
+- ie-2ntof q-n? e-2nil(y-a) +2 q-(n-k)? e-24(n-k) + 1} (y); 
k=1 
x (a NTT, y) = qn? e2nila-y) x (a, y) 
n-1 
— if qn? e2ni(a-y) + 2 qn*-k? e24(n-k) 1}9,(y)- 
k=1 


ie 

i 

n=- OO 
| 


in 
In 


ON CERTAIN FUNDAMENTAL IDENTITIES DUE TO USPENSKY. 293 


where the n appearing in the >| term of VI has been replaced by ¢, and 


n=1 
i, d, 8, h, A, 4’, t are subject to the conditions II. 

In the terms involving cotangents we change the index of summation 
from 1 to s (bringing in the multiplier 2, since 1 = 0 while s > 0), combine 
the differences of cosines into the product of two sines, and apply the formula, 

a-1 
sin(aw)ctn(w) = } cos(a— 2k)u. After combining terms and making an 
k=0 
obvious change in the index of summation, our expression contributes 
+ 4e(n)T +23 q{sin 2¢2 — sin 2t(x + z)}. 

If we split the last term of VIII into a sum corresponding to 1 = 0 and 

a sum where 1 =r (r restricted to positive values), we obtain 
+ 4a(n) ZL —23 q"{sin(2tz) — sin + z)}. 
Putting the last two results in VIII, changing all arguments to their half- 


values, and paraphrasing, we arrive at I. 


4. Proof of the theta identity. Consider the left-hand side as a func- 


tion of y alone. Then 
+ y + (2) 


f(y + nar) = f(y + nr) = f(y). 
The residues at the simple poles, y= 0 + nar, y= — ax + are respectively 
— q™ +2) and + Let C represent the contour 
in the y-complex plane composed of (n + 1) cells (of width) above and n cells 
below the real axis and consider the auxiliary function ¢(t) = f(t) /tan(t—y) 


which has poles at t= y, t= y +7, t=—a2+ nar. The residue 
at t= y is f(y). Derange the mesh so that poles lie within the boundary, 
apply Cauchy’s Theorem to ¢(¢) around (C, and allow n to become infimte.® 
Thus, 


. 
si, o(t)dt = 0 = & Residues 


=f(y) + ge ctn(y — ner) 


n=-O0 


co 
ctn(a + y— ner). 


n=-CO 


The last relation is the same as our identity VII. 


®Cf. M. A. Basoco, “ Fourier developments for certain pseudo-periodic functions 
in two variables,” loc. cit., pp. 244-245. 


Cc 
| 
is = 


294 W. A. DWYER. 


In his arithmetic proof of III and IV Uspensky splits the left-hand side 
into three sums S,, S2, S3 according as (8—d-+1) is > 0, < 0, or = 0, 
and sets up a pair of transformations establishing a one-to-one correspondence 
between the solutions of i? -+ 2d and those of h? + AA’ which obey the 
restrictions II(a) and II(b). S,-+ So, together with stated parity conditions, 
gives us identities III and IV. S, obviously vanishes because of the condition 
F(z, 0,z) =0. If, however, we change the parity conditions to agree with 
those of I, the more general condition F'(0,0,0) = 0 demands that we con- 
sider the contribution of the term S8;. From II(a) 


+ 2d8. If then n = d? + &. 


Consequently 

F(8+1,8+i—d,1) 
is of the form 

S; = 3 F(d,0,d—8), 


which is of the same form and has the same partitions as the term a(n)L 
appearing in I. 


5. In conclusion, we may point out that the results obtained by Uspensky 
from his fundamental formulae are implicitly contained in our theta identity. 
The theta identity suggests other similar products of the theta and ¢-functions 
which, when treated in an analogous manner, will lead to general fundamental 
formulae of the same type as those of Uspensky. 


UNIVERSITY OF NEBRASKA. 


a 
a 
= 
ay 
} 
] 
( 


AN EXTENSION OF BERNSTEIN’S THEOREM ASSOCIATED 
WITH GENERAL BOUNDARY VALUE PROBLEMS.* 


By W. H. McEwen. 


Introduction. Consider the n-th order differential system 


(1) + P2(x) + Pn(x)u + Au = 0, 

U;(u) =0, (jul 
in which the functions P,(z),- - -, Pn(x) are continuous and have continuous 


derivatives of all orders on a= x5, and the U’s are n linearly independent 
conditions involving uw” (a), wu” (b), The general nature 
of the solutions and the expansion problems connected with this system have 
been discussed by Birkhoff, Tamarkin,? Milne* and Stone.* Let the char- 
acteristic values of the system, taken in the order of magnitude of their moduli, 
be Ay, and let be the corresponding characteristic 
solutions. The values A, are then the poles of the Green’s function of the 
system. Assume that the boundary conditions are normalized and regular.® 
Assume further that the values A, give rise to simple poles of the Green’s 
function when & is large.® 


N 
Let Sy(x) = > ajuj(x) be an arbitrary linear combination of the solu- 
j=1 


tions corresponding to the first N characteristic values, and let LZ be the maxi- 
mum value of | Sy(z)| onaSa2=b. It is the purpose of this paper to 


establish the following two theorems: 
THEOREM 1. On the intervalasSa#=b 


| S’x (a) | = qN°L, 


* Received October 12, 1936; revised January 25, 1937. 

1G. D. Birkhoff, “ Boundary value and expansion problems ete.,” Transactions of 
the American Mathematical Society, vol. 9 (1908), pp. 373-395. 

? J. Tamarkin, Rendiconti del Circolo di Palermo, vol. 34 (1912), pp. 345-395. 

3 W. E. Milne, Transactions of the American Mathematical Society, vol. 19 (1918), 
pp. 143-156. 

*M. H. Stone, “ A comparison of the Series of Fourier and Birkhoff,” Transactions 
of the American Mathematical Society, vol. 28 (1926), pp. 695-761. 

5 For definitions of these terms see Birkhoff, loc. cit., p. 382. 

° For a discussion of this assumption see footnote 12. 


295 


le 
8, 
n 
h 
8 
il 
|| 


296 W. H. MCEWEN. 
where q is a positive constant independent of N. 
THEOREM 2. On the interval a+é6=2=b—68 
| S’n(z)| S QNL, 
where Q is a positive constant independent of N. 


These theorems are analogous respectively to the theorems of Markoff 
and Bernstein as applied to polynomial sums. In connection with Theorem 2 
it should be noted that the limit YNZ cannot in general be extended to the 
end points a and b, as may be shown by an example,’ although in certain 
special cases the limit does apply uniformly to the whole interval. Examples 
of the latter are the systems that give rise to sums of Fourier or Sturm- 
Liouville type. The Sturm-Liouville case has been treated by Miss E. Carlson,’ 
who has also proved Theorems 1 and 2 for the case of a special 3rd order 
system.° 

The proofs given here are based on a number of results to be found in 
Professor Stone’s paper.’® This paper will be referred to hereafter as (S). 
The writer wishes to acknowledge his indebtedness to Professor Stone for 
valuable suggestions in connection with the form of presentation of the proofs. 


Preliminary discussion. We can assume, without loss in generality, that 
the interval of « is 0S a=1, and that the maximum value of | Sy(z)| on 
(0,1) is 1. 

Let G(x, y; A) be the Green’s function of system (1). The characteristic 
values of A are then the poles of G. The facts concerning the nature and 
distribution of these values are well known. They form two infinite sequences 
in the complex A-plane, given asymptotically by the formulas ™! 


(2) =— (2kart)"(1 + 
Ay” = — (— 2kwi)"(1+ /k), 


*The system du/dx + ru = 0, u(0) + u(1) =0 gives rise to sums Sy (#) which 
are sine and cosine sums in the variable X =a which ranges over a part only of a 
period interval 0=X=r. It follows then from the well known form of Bernstein’s 
theorem relating to a trigonometric sum on a part of a period interval, that the limit 
QNIJ can be assigned only to an interval which is interior to 0 =X =z, and hence to an 
interval of « which is interior to (0,1). 

° E. Carlson, Transactions of the American Mathematical Society, vol. 26 (1924), 
pp. 230-240. 

°E. Carlson, Transactions of the American Mathematical Society, vol. 28 (1926), 
pp. 485-447; pp. 439-447. 

10M. H. Stone, loc. cit. 
See Birkhoff, loc. cit., p. 383. 


4 

4 
t 


ASSOCIATED WITH GENERAL BOUNDARY VALUE PROBLEMS. 


THEOREM 


where EL’, EL” are bounded functions of k. For large values of k, in accordance 
with the assumption made on page 1,’* the poles of G are simple. Hence if 
multiple poles exist they are limited in number. Let Cy be a circle of the 
\-plane with centre at the origin which includes within its boundary the first 
N poles of G, Ai, *,Av and no others. Then the sum may be 
represented identically by the contour integral ** 


(3) = Sey) dy. 


provided the poles in question are all simple. If, however, certain of these 
poles are multiple, the integral given above will represent a sum oy(a) obtained 
from Sy(a) by replacing the terms corresponding to the multiple poles As by 


1 
terms of the form Syv(y) Rs (a, y) dy, where y) is the residue of G 
0 


at \=A;s. But the terms involved in this change are bounded independently 
of N, and their number also is independent of N. Hence it follows that S’y 
and o’y are of the same order of magnitude with respect to NV. Thus, for the 
purpose of our discussion, there is no loss in generality in assuming that 
Sy(x) is represented by (3). 

A more useful form of (3) is obtained by placing Ap”. Under this 
transformation the entire A-plane is made to correspond to a sector = in the 
p-plane, composed of two adjacent sectors of the following set of 2n equal 
sectors : 


S: lr/nS argp S 1)z/n, (i == 1,2,---,2n—1). 


The path of integration will then become the are T which the sector & cuts 
off from the circle with centre at the origin in the p-plane and radius equal to 


the n-th root of the radius of Cy. Hence we can write 


(4) Sy (x2) = if Sw(y) mp" "G(x, yse")dp | dy. 
Jo r 


“The assumption referred to is the one which demands that the poles of @ be 
simple when k& is large. This condition is not highly restrictive inasmuch as it is 


automatically satisfied if the system is regular and of odd order, and is in general 
satisfied if the system is regular and of even order. In the case of even order, however, 
it may happen that pairs of characteristic values coincide to give double values, and 
these in turn will give rise to either simple or double poles of G. If the system is self- 
adjoint the double values give rise to simple poles, but otherwise it is possible to have 
infinitely many double poles. Tamarkin has given an example of a regular system, 
n= 2, with infinitely many double poles (see Stone and Tamarkin, “ Remarks on a 
paper by Dr. Tautz,” Acta Mathematica, 1931). It is this type of system which our 
hypothesis rules out. 

18 See Birkhoff, loc. cit., p. 379. 


297 


298 W. H. MCEWEN. 


A special case of (1) is the system (discussed in S, pp. 709-711) 
d"u/dx" + r\u=0, u?(0) — uP (1) =0, (7 = — 1), 


which gives rise to sums of Fourier type. Let G(x, y;2) denote the Green's 
function in this case. The arc I may be drawn so as to avoid the poles of G 
as well as those of G. Then the integral 


(5) f, | me (a, 95 | dy 


defines a sum T'y(zx) which is a trigonometric sum of order N/2 on the period 
interval 021, where |N—WN|=K independent of the radius of I. 
The latter relation means that O(N/2) = O(N). 

In regard to the arc I we shall demand further that it be kept uniformly 
away from the poles of G and G when the radius is large. 

We next define a set of constants which play an important réle in the 
asymptotic formulas for G, G and their derivatives. These are the n n-th roots 
of —1, denoted by ;,2,- + *,n. For values of p on any given sector 8 
let the subscripts be so chosen that 


R(po:) S R(por) -S R(pon), R=“the real part of.” 
Then, if the system is of odd order 
n = 2u—1, 


R (pon) = 0 on the bisecting ray of S, so that in one half of S R(pon) <0 
whereas in the other half R(poy) > 0. Let these two halves be denoted by 8’ 
and S” respectively. Thus we have 


R(po) R( pop) R(pon) on 9”. 


On the other hand, if the system is of even order 
n= 2p, 


on = — and on one of the bounding rays of S R(pwn) = R(pops) = 9, 
so that, throughout the whole of 8S, 


These results enable us to state the conditions under which the exponential 
functions j = 1,2,- - -,n, occurring in the asymptotic formulas for 
G, G, etc. are bounded in the form 


| 
th 
| 
| 


THEOREM ASSOCIATED WITH GENERAL BOUNDARY VALUE PROBLEMS. 299 


(6) jemew|S1, 


for all values of p in question. This inequality holds whenever the real part 
of pwj(2—y) is negative or zero, and hence the specific requirements to be 
met are as follows: 


Case n=2n—1. 


(6’) 2=7; p on 
* * p on 8”, 

Case 2. n= 2p. 
(6”) ay: on S, 


In the explicit formulas for G, G, ete., which will be used presently, it will be 
seen that these conditions are satisfied in every instance, so that the exponentials 
occurring in these formulas are bounded in the manner of (6). 

We now observe a number of lemmas, the essential parts of the proofs 
of which are based on results found in (S). The notation {A; B} is used to 
indicate that A is to be taken if t= y, and BifxSy. The letter R is used 
to denote the radius of the circular arc I’, so that for values of p on T'| p | — R. 
In outlining the proofs it is necessary to treat separately the cases n = 24—1 
and n = 2, inasmuch as the asymptotic formulas concerned are different in 
these two cases. It is sufficient, however, to treat only one of the two equal 
sectors S which make up 3. The part of I belonging to S will be denoted by y, 
and the two halves of it corresponding to 9’, S” by y’, y” respectively. 


Lemma 1. For values of p on T 


uniformly on 0S 2, yl. 


Case 1. n=2u4—1. For values of p on y the following asymptotic 
formula for the expression in question is given in (S, p. 745, with k =1): 


0G) 
n-1 
(7) mp 0x 


A, 


9G, \ 


W. H. MCEWEN. 


= — > (Ay, (x) + Bi(y) + pos), 
j=1 


Py 3S (Ay (2) + Bi(y) + pos), 


([40] + = O(1),™ 


and A,” is a determinant of order n + 1, which, if expanded according to 
the elements of the first row, has the form 


n 
A, =2 (png) 5 (2, p) + (prj) (x, y, p)- 

The functions m;(z,y,p), Ai:(@), Bi(y), Qj(%,y,p) are bounded in their 
respective variables on 0 y= 1 when is large. A similar formula holds 
when p is on y’, the only difference being that the summations are extended 
over the ranges (1,4—1) and (y,7n). 

On examining the individual terms in these expressions it is seen that 
conditions (6’) are satisfied, so that the exponentials are all bounded in the 
manner of (6). Hence the terms are either O(1) or O(R#) for values of p 


0G OG 
on y and 7”; that is, np | an? Ie \ O(R) on y. 

Case 2. n=2y. The asymptotic formula for np" { \ when p 
is on y is similar in form to the one used in Case 1, p on 8’; an explicit 
expression for it is given in (8S, p. 760, with k ~1). The exponentials in- 
volved satisfy conditions (6”) and so are bounded as in (6), and the steps 


of the proof go through exactly as in Case 1. 
LEMMA 2. f np" "| G(a, p") —G(a, p") |dp = O(1) uniformly on 
Case 1. n=2u4—1. This is (S, Theorem VII, p. 716, with 1 


Case 2. n=2y. The proof in this case is analogous to that of the 
theorem cited above. In brief outline it is as follows: 
For p on y we have, by (S, p. 755), 


**The arc T must be kept uniformly away from the poles of @ when R is large, 
so as to make this fraction bounded for large values of R. In view of the manner of 
distribution of the characteristic values it is clear that this can always be done. 


300 
where 
| 
an 
i 


t 


e, 
yf 


THEOREM ASSOCIATED WITH GENERAL BOUNDARY VALUE PROBLEMS. 301 


n 
np™"G (2x, y3 p") = {— 
j=1 


A3 


The arc y being kept uniformly away. from the poles of G when BR is large, 


the denominator of the second term is bounded away from zero. The numerator 
of the second term is a determinant of order n + 1, which, if expanded ac- 
cording to the elements of the first row, has the form of a linear combination 
of the functions (j=1,---,p), with 
coefficients which are bounded functions of 2, y, p. The second term above 
may thus be written in the form 


9, p) + (2, 9, 9); 
j=1 

in which the functions M;(z2, y, p) are uniformly bounded on 0S 2, y = 1 for 
large values of R. 

In the special case of system (1) which defines G, a corresponding formula 
will represent np"'G@ on y. It will be identical with the one given above except 
for different terms of higher order in the asymptotic forms [;] and different 
functions M;(2,y,p). Hence on subtracting these results we obtain 


Np (¢G—G)= — — ;+ ees (a-y) 
Pp 


jal p 


n 
+ — + — Bj). 
j=l j 


j=prl 
The exponentials in { } are bounded as in (6). Hence np" (G — G) dp 


is expressible in terms of integrals of the form 


md 
p f (j 1, f md p (j i, n), 


which, according to (S, Lemmas III, IV’, V’, p. 714 and pp. 754-755, with 
k=] are uniformly bounded on 0 << 8S 
4 0G 0G AG) 


** The notation [W] indicates an asymptotic form in p in which W is the leading 
term. 


r 
n 
i 
t 
p 
| 


302 W. H. MCEWEN. 


Case 1. For pony’ we use again formula (7) of Lemma 1, 
writing E,(2,y,p) for Q;(2, y, p)/([60] + #[6,]). 
(0G 0G) { mj mj; | 


A similar form holds for G, with F,,°, Fi, mj, Ej replaced by F,1°, F,1', 
mj, E; respectively. Hence on forming the difference of these two formuals 
we obtain 
{ 0G 0G | 
(Fi,° — F,,°) (a-y) (ms; — m3) mj) 
j=1 
(Put + > eps (a-y) (ms — 


+3 (pw; ) — +E (ows) (By — By). 


But 
~ (As, + Bi(y) + pos), 
=1 


whereas, according to (S, p. 746, with s —1), 
= — (pw;). 
=1 
Hence, by (6), 
— =— ~ + Bi(y)) = O(1). 
=1 


Likewise F,,1— F,,;1—O(1). The remaining terms of { } are similarly 
bounded (approaching zero as R-» «). Hence the integral of the expression 
in { } taken over the arc 7’ will be of the order R. Moreover the integrals 


(EB; — B;)dp 

7’ 

f — B;) dp 


converge to zero uniformly on 0 << 8S as R- in accordance 
with (S, Lemma IV, p. 714, with k =1). Hence 


iG 0G 0G 


| 
4 
if 
| 
ij 
if 


THEOREM ASSOCIATED WITH GENERAL BOUNDARY VALUE PROBLEMS. 303 


In a like manner we find that the integral of this expression over y” is of 


the form 


” 


O(R) + oy perme (Hy — dp. 
7 


It remains now for us to determine the order of the second terms in the 
two expressions written above. The integrals involved may be put in the form 


y’ 4” 


where m’ = p(Ey— Ey) /R =O(1) ony’, and m” = — = O(1) 
on y’. An application of (S, Lemma V, p. 714, with k=1—0) will then 
show that the multipliers of R are uniformly bounded on 0 << 8S e#=1—48, 
and hence we conclude that the second terms also are of order R in this interval. 


Case 2. n=2p. The argument in this case, based on appropriate 
formulas found in (S), is entirely similar to the one just given. 


Lemma 4. The number of poles of G(x, y;p") on the sector & enclosed 
by the arc T is given asymptotically by 
(each pole being counted according to its multiplicity). 


This is immediately evident from formulas (2) which give the distribu- 
tion of the characteristic values of X =p”. From this lemma it follows that 


O(R) = O(N). 


The proofs of Theorems 1 and 2. On differentiating with respect to z 
in formula (4) we get 


1 0G OG 
(2) Sw(y) Ox } dp | dy 
But | Sv(z) | S10n0SyS1. Hence, by Lemmas 1 and 4, 
1 
= 55 f dp | dy — — O(N") 
mQSa¢=1. This proves Theorem 1. 


Next. let us consider the trigonometric sum 7’y(x) defined by (5). On 
subtracting it from Sy(a) we have, by reason of Lemma 2, 


L 
eroutin’ dp, R dp, 


304 W. H. MCEWEN. 


1 7 
—Tx(2) = Sw(y) [ | dy 
7 Jo r 
1 1 
Sx (y) O(1) dy = O(1) 
T 0 
on 0<8S251—85. But | Sy(x)| =1, and hence 
=O(1) 


on the interval 0 << §6= 2 =1—8, which is interior to the period interval 
(0,1). It follows then, from a special form of Bernstein’s theorem given by 
D. Jackson,'*® that 

(8) = O(N/2) = O(N) 


uniformly on this interior interval. 
Finally, on differentiating with respect to 2 in the formula for 
Sy(x) —Ty(z) we obtain, by the help of Lemmas 38 and 4, 


S'x(2) —T's(2) = 55 Sx(y) Lf. np 


Sx(y)0(R)dy 0(R) = O(N) 
uniformly on 0 < 8S x=1—85. Hence, by (8), 
= + O(N) = O(N) 
uniformly on 0< 8 This proves Theorem 2. 


Application to a problem of best approximation. Let f(x) be a given 
function continuous on a= «2b, and let it be required to define for each 


positive integral value of N a function of best approximation to f(x) of 
the form 


N 
= > axuz(2), 


in which the u’s are the characteristic solutions of system (1). This may be 
done by adopting as a measure of approximation the integral 


f(x) —Sy(ax)|" de, 


where r is any given real constant > 0, and requiring that the coefficients of 


16D. Jackson, Transactions of the American Mathematical Society, vol. 26 (1924), 
pp. 133-154; p. 145. 


IA 


be 


) 

] 

| 0 
t 

t 

V 

a 

| 

PI 


THEOREM ASSOCIATED WITH GENERAL BOUNDARY VALUE PROBLEMS. 305 


Sy(a) be chosen in such a way as to give a minimum value to this integral. 
It is well known that such determinations of the coefficients can always be 
made, and when 7 > 1 the result is unique. 

The question of the convergence of Sy(x) to f(x) as N becomes infinite 
may be investigated by methods similar to those used by Jackson *’ in the 
study of the corresponding problems relating to trigonometric sums and poly- 
nomials. These methods involve in an essential way the use of Theorems 1 
and 2, and lead to the following general theorem: 


THEOREM 3. If ry(x) be an arbitrary sum of the u’s of order N and 
hy be the maximum value of | f(x) —ay(x)| on aS then there will 
exist positive constants C, and C, independent of N such that 


(a) on | f(x) — 


(b) | f(x) —Sy(x)| hy. 


Thus the question of convergence is made to depend directly on the degree 
of approximation represented by hy, that is on the degree of approximation 
to f(a) that is possible by sums of the form zy(x). In this connection we have 
the theorems on the degree of convergence of Birkoff’s series given by Milne ** 
which enable us to state explicit hypotheses under which the quantities N?/"hy 
and N’/"hy will converge to zero as N becomes infinite. The following theorem 
is given as typical of what can be done in this direction: 


THEOREM. In the case r>1, if f(x) has a first derivative of limited 
variation on a= a Sb, and if f(x) vanishes at a and b, then hy = O(1/N) 
so that Sy(a) converges uniformly to f(x) on the sub-interval a+8S2 
= b—8as N becomes infinite. 


Mount ALLISON UNIVERSITY, 
SACKVILLE, N.B., CANADA. 


17T), Jackson, American Mathematical Society, Colloquium Publications, vol. 11, 
pp. 92-101; see also D. Jackson, Bulletin of the American Mathematical Society, Decem- 
ber, 1933, pp. 889-906. 

**W. E. Milne, loc. cit., pp. 154-156. 


6 


ABSTRACT COVARIANT VECTOR FIELDS IN A GENERAL 
ABSOLUTE CALCULUS.* 


By A. D. MIcHAt. 


Introduction. The elements of a general absolute differential calculus 
based on a linear connection and the notion of a contravariant vector field 
only has recently been considered by me.’ In the present paper additional 
postulates on the transformation of Banach “ codrdinates ” are considered and 
a linear connection of covariant type is postulated. A brief treatment is then 
given of a general absolute differential calculus based on covariant vector fields 
as well as on contravariant vector fields. The ideas centering around the 
various adjoints of the Fréchet differentials? are of fundamental importance 
here as well as in the instances in which the Banach space is an infinitely 
dimensional function space. 


1. Abstract coérdinate transformations. Let HZ be a Banach space in 
which there exists a function [z, y] with the following properties: ° 


(1) [2,y] is a bilinear function on HZ? to the real numbers 


(3) y] is positive definite; i.e., xz] and —0 if and only if 
r= 0. 


DEFINITION. A function T*(é) on E to E will be said to be the adjoint 
of a linear function T(é) on E to E if 


(1) T*(é) ts a linear function 
(2) [T(é), 0] = [é 


Let U, be a fixed Hausdorff neighborhood of a Hausdorff * space T. We 


* Presented to the American Mathematical Society under a different title, April, 
1936. Received by the Editors February 16, 1937. 

1Michal (1). 

2? By the notation f(a, *sY,3%) we shall always mean the partial Fréchet 
differential of f(a, + in & with increment A. Occasionally we shall write 


@ 

for 6x”) and dy for the partial Fréchet differential of f(z,y) in 

evaluated at y = a. 


* For motions and rotations in such spaces see Michal, Highberg and Taylor (1). 
*More generally one can take a Fréchet neighborhood space V and require the 
coérdinate systems to be merely reciprocal (1-1) transformations. 


306 


(2) [z, y| Ly; | 
| 

( 

( 

a 

| 

|| 


ABSTRACT COVARIANT VECTOR FIELDS. 307 


shall assume that there exists an open set S C F that is a homeomorphic map 
of Uy. We postulate the existence of codrdinate systems 7(P) : homeomorphic 
correspondences mapping Hausdorff neighborhoods onto open sets %C 8. 
Suppose z(P) and Z(P) are codrdinate systems for two intersecting Hausdorff 
neighborhoods U, and U; respectively and let 3, and 3, be the respective maps 
in 8S. Then the intersection of U,; and U, induces a homeomorphic mapping, 
called a codrdinate transformation, of an open subset S, of 3, onto an open 
subset of 2. We shall denote this codrdinate transformation by 

It is convenient to call the Hausdorff neighborhood and the map & C 8, 
the geometrical domain and the codrdinate domain respectively of the codrdi- 
nate system. We shall assume that each codrdinate transformation 7(a7) and 
its inverse z(Z) have Fréchet differentials 82) and throughout 
the sub-codrdinate domains 8, and S, respectively of the codrdinate systems 
a(P) and <(P). We shall further assume that <(2; 8a) possesses an adjoint 
6x) and that 8%) has an adjoint «*(Z;8z). It can be shown readily 
that (x; 8x) is a solvable linear function of dz with x(#;8Z) as inverse. 
From the postulates for [v, y] and the following evident steps 


[8z, €] 8Z)), €] = 8%), Z* (a; €)] 


it follows that 
(1.1) for all 
Similarly 
(1. 2) (a2; € for all 


From these two results it follows that £*(x;8z) is a solvable linear function 


of dz with «*(Z; 8) as inverse. 


2. Covariant differential of a covariant vector field. The absolute cal- 
culus of contravariant vector fields has been studied elsewhere.® The components 
of a geometric object have a characteristic law of transformation in the inter- 
section of two Hausdorff neighborhoods. The law of transformation for a 
contravariant vector field is 
(2.1) = 


DEFINITION 2.1. <A covariant vector field is a geometric object whose 
components transform in the intersection of two Hausdorff neighborhoods 
according to the law 
(2.2) i(@) n(z)) 


under a transformation of codrdinates Z(z). 


5 Michal (1). 


= 


308 A. D. MICHAL. 


The inverse of (2. 2) is 
(2.3) = 


In addition to the restrictions of § 1 we shall now assume that the second 
Fréchet differential Z(z; 8,7; 8.2) exists continuous in x and that the Fréchet 
differential 


du_x*(o;) 
exists continuous in z. It can be shown with the aid of theorems proved 


elsewhere ® that z(%;8,Z; 6%) exists continuous in Z. It can also be shown 
that the adjoint 7*(Z; 3A) of the linear function x(Z;;A) of A exists con- 


tinuous in # and that the adjoint z*(Z, A, ») of the linear function i £*(o;A) 
of » exists continuous in Z. 


THEOREM 2.1. Under the restrictions on codrdinate transformations just 
described, the following relations are valid 


(2.4) de 3A) 
“ 


(2. 5) ds a*(o;A) =2*(4,A,p). 
“ 


The functions and are bilinear in d and and self 
adjoint as linear functions of wp. 


Proof. On differentiating 


(2. 6) = [v, 
we obtain 

(2.7) [v, dg a*(o3d)]. 
Clearly 


Hence from (2.7), (2.8) and the positive definiteness of the inner product 


there results (2.4). 
From (2.7%) and the definition of 7*(Z,A,v) we have 


(2.9) = [p, A, v) 


But z(Z;v;p) is symmetric in v and p» so that (2.9) makes clear that 


® Michal (2); Michal and Elconin (1). 


tk 
a 
wi 
ve 
of 
C01 
(2 
un 
Th 
we 
(2. 
He 
(2. 
The 
(2. 
be a 
vect 
varie 
cova) 
| the 
conti 
| argu 


ABSTRACT COVARIANT VECTOR FIELDS. 309 


z*(Z,,v) is self adjoint as a linear function of v. Hence (2.5) is valid. 
Finally the bilinearity of and in A and follows from 


the continuity of dc xz*(o;X) in Z and a theorem of Banach on the linearity 


of the limit of a sequence of linear functions.’ 


THEOREM 2.2. A necessary and sufficient condition that [é(x), n(x) ] be 
a scalar invariant for an arbitrary contravariant vector ® (x) is that n(x) be a 
covariant vector.® 


DEFINITION 2.2. A covariant linear connection is a geometric object 
with components L(x, n(x), that are bilinear functions of a covariant 
vector n(x) and a contravariant vector é(x) and such that in the intersection 
of the geometrical domains of two codrdinate systems x(P) and «&(P), the 
components have the law of transformation 


(2.10) L(%,7(Z), €(£)) —a* (4; L(x, n(x), €(x)) + 5 
under the transformation of codrdinates = 


Let the covariant vector 7(x) have a continuous differential (2x; 62). 
Then from known theorems on Fréchet differentials and from Theorem 2. 1 
we obtain 


(2. 11) 82) = (£3 dx) ) + (£3 
Hence with the aid of (2.10) we obtain 
(2.12) 4(@; 8%) 8%) = n(x; 8x) — L(x, 4 (2), dx) 
The steps are reversible so that we have proved the 
THEOREM 2.3. A necessary and sufficient condition that 
(2. 13) n(x; 82) — L(2, n(x), dx) 


be a covariant vector whenever n(x) is any continuously differentiable covariant 
vector 1s that L(x,(x), 8a) be a covariant linear connection. 


7™Michal (2). 

® We use contravariant vector and covariant vector as abbreviations for contra- 
variant vector field and covariant vector field respectively. One can, however, define a 
covariant vector (strict) and contravariant vector (differential) as usual and recast 
the definitions and theorems (except Theorem 2.3) in the obvious way by substituting 
contravariant (covariant) vectors for contravariant (covariant) vector fields in the 
arguments of the linear connections and multilinear forms. 


= 


A. D. MICHAL. 


DEFINITION 2.3. If L(a, (x), 8x) is a covariant linear connection, then 
the linear form y(x/dx) in 8x defined by 
(2. 14) n(x/8x) = n(x; 8x) — n(x), 82) 
wil be called the covariant differential (based on L) of the covariant vector, 

Let '(2, &:(x),&.(x) ) (not necessarily symmetric) be a linear connection,® 
where é,(z), (x) are contravariant vectors. This is to be distinguished from 
the covariant linear connection of Theorem 2.3. The law of transformation 1° 
for a linear connection is 


(2.15) I'(Z, &(2) ) = (2,8 (x), &(2))) + (4) &(Z))), 


An equivalent law of transformation to (2.15) is 
(2.16) &(@)) = (2, &(x))) & (2) 
We shall use these laws of transformation in the next section. 

3. Covariant differential of multilinear forms in covariant and contra- 
variant vector fields. Since the covariant differential of a covariant vector is 
a covariant vector depending linearly on an arbitrary contravariant vector, it is 
clear that the theory of successive covariant differentials can be brought under 
the theory of covariant differentials of muitilinear forms. We shall prove 

THEOREM 3.1. If 

(i) F(a, &(2),- +, is covariant vector 
valued multilinear form in the continuously differentiable contravariant vectors 
+, & (a) and covariant vectors n,(x),° 


(ii) the partial differential F(a, -, 993 8x) exists con- 
tinuous in x, 


then the function +, &(%),m(2),° ) defined by 
—= F(a, 5 82°) 


L (a, ni(x), 8), nis 98 (7) ) 


® Michal (1), (2). 
1° Michal (1). 


310 
I 


ABSTRACT COVARIANT VECTOR FIELDS. 


is a covariant vector valued multilinear form in &,° +, * *5 Ns, 
We shall call F(a, *,9s/8x) the covariant differential of 


Proof. We shall give the details of proof for r—1, s —1 as the proof 
for the general case, although lengthy, differs in no essential manner from that 
of this special case. 

By hypothesis 


(3.2) = 0* (4; F(a, &(2), 9(2))) 
from which we obtain with the aid of Theorem 2. 1 


= a* (2; F(x, E(x), ; 8x) ) 
(3.3) + (€; 8%; F(x, E(x), 4(x))) 


+ (4; dg, P(x, n(0))) —dg, F(z, 8(0), 5(2)). 
On using (2.16) and (3.2) we find that 


| F(a, T(z, &(2), 82), 9(2))) 
FES, €(x) ; 8x), H(z ‘)). 


Similarly from (2.10) and (3. 2) 


fF &(Z), L(#,7(Z), br) ) = a* (4; E(x), L(x, n(2), éx))) 
F(a, (2; 885 »(2))). 
Evidently 


(3 6) J (x; L (2, F(a, (2), ), 8x) ) 
+ 2* (&; 8; F(a, €(2),9(2))). 


Taking the differential of (2.1) and using (3.2) we obtain 


(3.7) F(Z, ) = F (4, ; 8x), 9(Z) ) 
+ 2* (2; F(z, 


Similarly from (2.11) and (3. 2) 


F(a, €(£), & ‘)) = FG, E(Z) x* q(x) ) ) 
{ + 2* (2; é(z), 


Reducing (3.3) by means of (3.7) and (3.8) 


312 A. D. MICHAL. 


— F(x, (x), n(x) ; 8c) ) + (4; 84; F(a, 
— F(z, E(x 3€(2) ; bx), —F(4, &(Z), 2*(Z; 


Finally with the aid of (3.4), (3.5), (3.6), and (3.9) we obtain 
(4, = (£; F(a, &(x), 4(x)/8z) ), 


which completes the proof of the special case r= 1, s = 1. 
The following two theorems can now be proved without much difficulty. 


THEOREM 3.2. If in the hypotheses of Theorem 3.1, the function 
F(a, * 7s) is taken to be a contravariant vector valued multi- 
linear form, then the function F(a, s/8x) defined by 


j F(a, , (x), ',&(z), m(Z),° s(x) /8x) 
— F(z, :(Z),°° » s(x) ; 8x) 


8 
+ F(2, &1, » Nis * 5 Ni-1, L (a, Nis 5x), Misty" 8) 
i=1 


is a contravariant valued multilinear form in &,° ++, &r, *5 8% 
We shall call F(z, +, & *,7s/8e) the covariant diffierential of 


THEOREM 3.3. If in the hypotheses of Theorem 3.1, the function 
F(a, ns) ts taken to be an absolute scalar multilinear form 
(with numerical values or with values in a Banach space), then the function 


= m1, °°* Ne 82) 


i=1 


i=1 


is an absolute scalar multilinear form (with numerical or Banach values) in 
covariant differential of F(x, &,° 715° 


P (a, &(@), 82) 
| 


ABSTRACT COVARIANT VECTOR FIELDS. 313 


To have the successive covariant differential »(7/8,2/-- -/8x) well 
defined in all codrdinate systems, it is sufficient to assume that (1) the covariant 
vector 7(z) possesses a continuous p-th differential; (2) L(2,»,6r) has a 
continuous p-th partial differential in the first place; (3) I'(2, é.,é) has a 
continuous (p—1)-st partial differential in the first place; (4) (2) has a 
continuous (p-+ 2)-nd differential; and (5) 2*(Z;X) has a continuous 
(p +1)-st differential in Z. 

By calculation we obtain the commutation rule 


(3.12) — 
= — L (2, 8:2, 8.4) — 2y(x/Q (2, 8:2, 82.7) ), 
where 
(3. 13) n(x), 8.0) = n(x), 8:03; 8.4) — L (a, n(x), 812) 
L(2, L(x, n(x), 822), 8:0) L (2, 8:2), 520) 
and 
(3. 14) OQ (2, 8:2, 8.7) = (2, 8,2, 8.4) 820, 5:2) }. 


Since Q(z, 5,2, 6.7) is the contravariant vector valued torsion form, it follows 
from (3.12) and Theorem 3.1 that the trilinear form L (2, n(x), 8.2) is a 
covariant vector valued trilinear form, called the (covariant vector valued) 
curvature form. 

Suppose now that F(z, (x), 6x) is bilinear in covariant vectors (2) 
and in and suppose further that F(z, (2), 8x) is the component of a 
geometric object. On making special use of the properties of the adjoints and 
the law of transformation of the linear connection one can demonstrate without 
much difficulty the following theorem. 


THEOREM 3.4. A necessary and sufficient condition that the adjointness 
relation 


dx), (x) ] = F(z, n(2), 8x) ] 


be a geometric condition (i.¢., continues to hold under a transformation of 
coordinates) is that F(x, (x), 8x) be a covariant linear connection. 


The importance of a relation of the above type between the linear con- 
nection [ and the covariant linear connection LZ is made clear from the fol- 
lowing result. 


THEOREM 3.5. A necessary and sufficient condition that 


8[é(x), n(x) ] = [€(2/dx), n(x) ] + (E(x), 


314 A. D. MICHAL. 


for all continuously differentiable contravariant vectors (x) and covariant 
vectors n(x) is that the covariant linear connection L(x, (x),8x) be the 
adjoint of the linear connection (a, (x), 84) considered as a linear function 


of &(z). 


CALIFORNIA INSTITUTE OF TECHNOLOGY, 
PASADENA, CALIFORNIA. 


REFERENCES. 


A. D. Michal, I. E. Highberg and A. E. Taylor: 
(1) “ Abstract Euclidean spaces with independently postulated analytical 
and geometrical metrics,” Annali di Pisa (in press). 
A. D. Michal: 
(1) “General tensor analysis,’ Bulletin of the American Mathematical 
Society (in press). i 
(2) “ Postulates for a linear connection,” Annali di Matematica (in press). 
A. D. Michal and V. Elconin: 
(1) “Completely integrable differential equations in abstract spaces,” Acta 
Mathematica (in press). 


M: 


a 
n 
il 
u 
M 
Cl 
el 
se 
| se 
he 
m 
Ww 
he 
f th 
no 
to 
cul 
ani 
ma 


A TYPE OF HOMOGENEITY FOR CONTINUOUS CURVES." 


By CuHartes H. WHEELER, III. 


Introduction. A set of points M is said to be homogeneous? if, given any 
two points x and y of M, it is possible to find a homeomorphism which will 
send M into itself in such a way that 2 is sent into y. 

A set of points M is said to be bi-homogeneous * if given any two points 
z and y of M there exists a homeomorphism which sends M into itself in such 
a way that z is sent into y and y is sent into a. 

We will investigate the conditions under which a compact, locally con- 
nected continuum may be cyclic element homogeneous, i.e., given any two 
true cyclic elements * of a set M, there exists a homeomorphism which sends 
M into itself in such a way that one of the given true cyclic elements is sent 
into the other. A compact locally connected continuum is a continuous curve 
in the sense that it is a set of points which is the image of the unit interval 
under a continuous transformation. 

The case where M contains only a finite number of true cyclic elements 
will be completely treated, while some results will be stated for the case where 
M contains infinitely many true cyclic elements. 

Two simple closed curves joined by a simple are provide an example of a 
cyclic element homogeneous set. The curve illustrated in Fig. 1 is not cyclic 
element homogeneous because the true cyclic element marked C, cannot be 
sent into any of the other true cyclic elements by a homeomorphism which 
sends the set into itself. In Fig. 2, also in Fig. 1, each true cyclic element is 
homeomorphic with each of the remaining ones, but in Fig. 2 the cyclic ele- 
ment marked C, cannot be sent into any of the others by a homeomorphism 
which sends the set into itself. Fig. 3 is an example of a set of points which 
contains an infinite number of true cyclic elements and is cyclic element 


homogeneous. 


1 Received April 18, 1935; revised October 15, 1936. 

*See Kuratowski, Fundamenta Mathematicae, T. 3 (1922), pp. 14-19, also 
Mazurkiewicz, Fundamenta Mathematicae, T. 5 (1924), pp. 137-146. 

3 Kuratowski, loc. cit. 

‘The cyclic elements of a locally connected continuum are (1) all cut points of 
the continuum and (2) the set of all points conjugate to a point p, where p is any 
non-cut point of the continuum. A true cyclic element is one which does not reduce 
to a single point. See G. T. Whyburn, “Concerning the structure of a continuous 
curve,” American Journal of Mathematics, vol. 50 (1928), pp. 167-194, and Kuratowski 
and Whyburn, “Sur les eléménts cycliques et leurs applications,” Fundamenta Mathe- 
maticae, T. 16 (1930), pp. 305-331. 

315 


316 CHARLES H. WHEELER, III. 


1. Preliminary theorems. Let M be any compact and locally connected 
continuum, which we shall consider as a space. Designate by H the smallest 
A-set ® which contains all the true cyclic elements C; in M, i.e., an A-set 
containing all the true cyclic elements C; and not a proper subset of any A-set 


n 


Fig. | Fig. ra 


Fig. 


containing all the true cyclic elements. The set H may be obtained by taking 
the product of all A-sets in M which contain all the true cyclic elements. Then 
since © the product of any number of A-sets is an A-set, it follows H is an 


® A closed set which has the property that if 2,yeA then every arc wy in the 
space is contained in A. 
*Cf. Kuratowski and Whyburn, loc. cit., Theorem 4: 1. 


th 


aC 


A 
e] 
el 
01 
H 
ta 
cy 
le: 
ne 
| all 
an 
abe 
= 
suc 
the 
If 


A TYPE OF HOMOLOGY FOR CONTINUOUS CURVES. 317 


A-set and clearly it is the smallest such set containing all the true cyclic 
elements. If there are only a finite number of true cyclic elements, the set H 
may be obtained by choosing a non-cut point p; from each true cyclic element 
C;, and forming all the possible pairs of these points. Then for each pair 


pi, pj form the cyclic chain C;,(pi, pj). We then have H = pi, 
1 


1.1. THrorem. If M contains only a finite number (> 1) of true cyclic 
elements, then there exists at least two true cyclic elements each of which has 
only one point in common with the closure of the remainder of H. 


Proof. This follows immediately from a theorem of G. T. Whyburn.’ 
He proves that if a continuum has more than one cyclic element then it con- 
tains at least two nodes, where a node is defined as an end point or a true 
cyclic element which contains only one cut point. Hence H must contain at 
least two true cyclic elements which contain only one cut point since it contains 


no end points. 


1.2. THroreM. If T is any homeomorphism which sends M into itself, 
then T(H) = H. 


Proof. The set H is uniquely defined as the smallest A-set containing 
all the true cyclic elements of M. Any homeomorphism 7 will send H into 
an A-set which contains all the true cyclic elements of M, thus H CT(#H). 

Since 7-' is a homeomorphism, H C T-'(H). Operating upon this with 
T we have T(H) CTT“ (H) =H. Thus 7(H) CH. It follows from the 
above two inclusions that 7 (H) =H, and the theorem is proved. 

We will consider H to be our space for the remainder of this paper. H is 
a compact locally connected continuum. 


1.3. Definition. A set H is said to be cyclic element homogenecus if, 
given any two true cyclic elements of H, then there exists a homeomorphism 7 
such that 7’(H) —H and one of the given true cyclic elements is sent intv 
the other. 

From this definition we have immediately the following: 


1.4. THrorEM. Jf H is cyclic element homogeneous, then each true 
cyclic element contains the same number (finite or infinite) of cut points of H. 


%. The case where H contains only a finite number of true cyclic elements. 
If the set H contains only a finite number (> 1) of true cyclic elements, 


*“ Concerning the structure of a continuous curve,” loc. cit., p. 180. 


318 CHARLES H. WHEELER, III. 


there exists a true cyclic element C’, which contains only one point which cuts 
H by 1.1. Then every true cyclic element of H has only one point in common 


with the closure of the remainder of H, by 1.3, i.e, Ci: H—Ci = a single 
point. 


Definition. Let p,* be the point of C; such that p,teH—C;. The sum 
of all true cyclic elements which contain ,‘ is called a cluster cf true cyclic 
elements. The point p,‘ = p; is called the center of the cluster Kj. 


Then K; => and Cii —p;. The centers of every two clusters 
i=1 4=1 


are joined by a simple arc in H, since H/ is an A-set. The arc pjp,x is such that 
piper’ Kj and Ki = pr, 


for we have seen above that no C;/ of Kj can have more than one point in 


common with H — C;/. 


2.1. Turorem. If H is cyclic element homogeneous and contains only a 
finite number of true cyclic elements, then there is the same number of true 


cyclic elements in each cluster. 


Proof. Suppose the contrary, then there exists some two clusters suci: that 


m 


K. —= K. C,?, n > Mm. 
1 


The points p; and ps are the centers of the ciusters K, and K, respectively. 
Since H is cyclic element homogeneous, let 7(H) =H in such a way that 
T (Cy) =C,?. Now = p, and C,? Kz — Cy? = po, and hence 
T (pi) = pe 

Since C,'~- p,, then T(C.1) some C;,? for some i, i= 2,3,- m. 
This is also true for the remaining n — 2 true cyclic elements in K,, but there 
are m—n less true cyclic elements in K, than in K,, so this is impossible. 
Thus the supposition that the clusters do not contain the same number of true 


cyclic elements leads to a contradiction, and the theorem is proved. 


2.2. Corotuary. If a true cyclic element of one cluster is transformed 
by a homeomorphism T into a true cyclic element of another cluster, then the 
first cluster is transformed into the second cluster by T. 


2.3. THroreM. If H is cyclic element homogeneous and contains only 
a finite number of true cyclic elements, then no cluster cuts H. 


n n 
( 
1 
j 
i 
a 


A TYPE OF HOMOLOGY FOR CONTINUOUS CURVES. 319 


Proof. Suppose there exists no cluster which does not cut H. There are 
only a finite number of clusters since there are only a finite number of true 
cyclic elements in H. Then H — K, = H,' + H,', where 


and Hi 


H,} contains at least one cluster K;,. Let Kn, be the cluster in H,? with 
the least subscript. Then H — Ky, = H,* + H,.?, where 
H2=H,?-H2=0, H2~A0AH and H2DK,. 
H,? contains at least one cluster K;i,, Ki, Kn, Let Kn, be the 
cluster in H,’ with the least subscript. Then H — K,, = H,’ + H,°, where 


HYA0AH and Kn, 


contains at least one cluster Ki, Kn,. 

This can be carried on indefinitely, but this is impossible for there are 
only a finite number of clusters. Thus there exists at least one cluster K+ 
which does not cut H. 

Let K, be any cluster of H different from K;. By 2.2 H — K; is homeo- 
morphic with H — K, and hence K, can not cut H. Thus no cluster cuts H. 


2.4. Definitions. 


2.41. A compact locally connected continuum JJ is said to be symmetrical 
with respect to a cyclic element C’ (whether C be a true cyclic element or not) 
if every component of H/—C is homeomorphic with every other component 
of H—C. We will call C the center of symmetry. 


2.42. A set of points H is said to be cyclic symmetrical with respect to a 
cyclic element C (whether C be a true cyclic element or not) if it is sym- 
metrical and any true cyclic element of one component of JJ —C may be 
sent into any true cyclic element of another component of H—C by a 
homeomorphism which sends the first component into the second. 


2.43. A major branch at a branch point x is the component of H —z 
which contains the center of symmetry. 


2.44. A minor branch at a branch point x is a component of H —zx 
which does not contain the center of symmetry. 


2.5. TuHrorem. If H is cyclic element homogeneous and contains only 
a finite number of true cyclic elements, then there exists a point c such that 


H is cyclic symmetrical with respect to c. 


320 CHARLES H. WHEELER, III. 


Proof. We saw that the centers of every pair of clusters are joined by an 
arc in H, and that this arc has only the centers of the clusters in common with 


the clusters. Also by 2.3 no cluster cuts H. Thus H—> K; is an acyclic 
1 


curve with a finite number of branches. 

There is a center p; of a cluster K; at the end of each branch. Form all 
the possible pairs of the points pj. Take a pair p,, p. which has a maximum 
number of branch points on the arc joining them. If this number is odd let 
the middle branch point be c, if it is even let c be any point on the open arc 
joining the two middle branch points. The point c is uniquely determined 
in the case where the maximum number is odd, and uniquely determined to 
the extent of being any one of the points on an open arc in the case where the 


maximum number is even. Now 


Sn, 


where 8;, S2,- - -, Sn are components. If the number of branch points on the 
are joining p, and pe, is even there are only two components, while if this 
number is odd there may be any finite number of components. Let S,~- p, 
and S> P2- 

Number the branch points on §,; in the following way: Let the first 
branch point out from c be 2’, then on each minor branch from z' let the 


first branch points be 2,7, @2*,- *-,%m?. The branch point on the branch 
containing p, we will call z*. Let the first branch points on the minor 
branches from 2,” be +, The branch point on the branch 


containing p, we will call 2*. Continue in this manner until all the branch 
points have been numbered. Number the branch points on S, in the same 
manner except with the use of y instead of z. 

The number of branch points from c to p, is the same as the number from 
c to po; let us assume this number is k. 

We must now show that there exists a homeomorphism 7 such that 
T(S:) =S;. Since H is cyclic element homogeneous, let T be a homeo- 
morphism of H into itself such that 7'(C,1) =C,? where C,!-> p, and 
po. Then T(p,) = pe, and T(K,) =K,. The arc p,2* is sent into 
the arc poy* and 2* into y* by T. At the branch points z* and y* there are 
the same number of branches because x* and y* correspond to each other under 
a homeomorphism. There is no branch point on any of the minor branches 
at «* or y* for if there were one on some minor branch, say at y*, there would 
be more branch points on the arc joining one of the centers p; (on this branch) 
and p, than on the arc p,p.. Then p,, p. would not be a maximal pair. 


| 
| 


A TYPE OF HOMOLOGY FOR CONTINUOUS CURVES. 321 


The arc x*z*-* is sent into the are y*y*-! by T so that z** is sent into 
y**, There are the same number of branches at x** as at y** because x** 
and y** corrrespond to each other under a homeomorphism. There can not 
be more than one branch point on any of the minor branches at 2* or y*'; 
for if there were, p,, p2 would not have been a maximal pair. There must be 
one branch point on each of the minor branches at 2** and y** and the number 
of minor branches at these branch points is the same as the number of minor 
branches at or 

Now continue down the major branch at z** and y**. The are 214k? 
is sent into the are y**y*? by 7, and then 2** is sent into y*?. The number 
of minor branches at z** and y** is the same, because x* and y*? correspond 
to each other under a homeomorvhism. The minor branches at x*-* are homeo- 
morphic with the minor branches at y**. On continuing in this manner until 
we reach 7'(z') —y', it can be shown that the minor branches at z! are 
homeomorphic with the minor branches at y*. Then the arc 1c —c 1» sent 
into the arc y'c—c by T. Therefore = 82. 

If there are only two components of H —c, the theorem is proved. In 
the case where there are more than two components of H —c, take any com- 
ponent S, different from 8, and S,. Let ps be the center of the cluster Ky 
such that the number of branch points on the arc joining it to c is a maximum 
for all centers p; in S;. Let the number of branch points on this are be m. 
Then m S k, for if it were greater p,, p2 would not have been a maximal pair. 
Number the branch points on S; the same as on S;, denoting them by z 
instead of x. 

Since H is cyclic element homogeneous, let 7'(H) = H in such a way that 
T(C,*) = where C K, C 8; and 0,3? C C 83. Then T(K,) = Ks, 
T (pi) = ps, = and The number of branches at 
is the same as the number at 2”, for 2* and z” correspond to each other under 
a homeomorphism. There is no branch point on any of the minor branches at 
v* or 2”, for if there were, ps would not have been a maximal number of branch 
points from c. Then and T'(a*1) =z", The number 
of branch points at z** and z”-1 are the same since 2! and z”- correspond 
under a homeomorphism. The minor branches at z”-* are homeomorphic and 
homeomorphic to the minor branches at 2*, 

It is thus seen that if m =k, S, is homeomorphic with By wee &, 
say m+ 1k, then one minor branch at z' is sent into S; and z' is sent 
into c. Since 2' and c¢ correspond to each other under a homeomorphism, 
there are the same number of branches at 2" as there are at c. This may or 
may not be true; if not, it is seen immediately that 7’(H) ~ H, and therefore 


7 


322 CHARLES H. WHEELER, III. 


m-==k. If the number of branches at xz and c are the same, consider the 
number of branch points on the arc from 2’ to pee Sz. There arek +1. The 
branch from z’ containing these must be sent into a component of H —c 
different from S;; but we have seen that the maximum number of branch 
points from c to the center of any cluster in H —c was k. Therefore this is 
impossible and T(H) ~AH. Thus m =k. Therefore all of the components 
of H —c are homeomorphic. 

It must now be shown that any true cyclic element in one component may 
be sent into any true cyclic element in another component by a homeomorphism 
which sends the first component into the second. Let C,‘ be any true cyclic 
element in S; and C,/ any in 8S; Now C,+C Ky and C,/C Kj. Since 
H is cyclic element homogeneous, let 7(H) —H in such a manner that 
T(C,‘)= (Ci. It has already been shown that S, and S; are homeomorphic, 
that every cluster on S; and S; are the same number (/) of branch points away 
from c, that at each i-th branch point out from ¢ (11, 2,3,- --,k) there 
are the same number of minor branches which are homeomorphic with one 
another. Let the branch points on S, and S; be numbered in the same manner 
as were the branch points on S, and denoted by wu and 1, respectively. We 
have then = Kj, T(pi) = pj, T (piu®) = pjv* and T(u*) The 
remaining minor branches at u* are sent into the remaining minor branches 
at by T. Then T(uu**) = and T (uk?) The remaining 
minor branches at w** are sent into the remaining minor branches at 1 
by T. Continuing this finally yields T(u1) =v'. The remaining minor 
branches at u’ are sent into the remaining minor branches at v’. Then 
T(u'c—c) =v'c—c, thus T(S,) =S:. Q.E.D. 


2.6. Summary. We have proved that if H is cyclic element homogeneous 
and contains only a finite number of true cyclic elements C; then 
1) C,-H—C; = a single point, for each 1. 
2) Each cluster K; has the same number of true cyclic elements. 
3) H— Ki, for every i, is connected. 
4) H—}> K; is an acyclic curve with a finite number of branches. 
1 
5) There exists a point c such that 
a. Every cluster is the same number of branch points distant from c. 
b. At each i-th branch point from c, (i =1,2,: --,k) there are the 
same number of branches, and all the minor branches are homeo- 
morphic. 
c. On every component of 7 —c there are the same number of branch 


points and the same number of clusters. 


i 
| 


A TYPE OF HOMOLOGY FOR CONTINUOUS CURVES. 323 


d. Every component of H —c is homeomorphic with every other and 
any true cyclic element of one component may be sent into any 
true cyclic element of any other component by a homeomorphism 
T which sends the first component into the second and which 


sends H into itself. 
6) All the true cyclic elements are homeomorphic, and if pi7«C, and 
pi®e Cs such that C,-H pi” and C,- H—C, pi’, then 
the homeomorphism W(C,) is such that W(pi") = for 


some 1. 


Fig. 4 is an example of a set L which is cyclic element homogeneous. The 
point c is the center of symmetry, and there are three components of L —c. 


O * 
of 
Dink Sov 
a OO 
Fig. 4 | 


324 CHARLES H. WHEELER, III. 


2.7%. TuroreM. If H contains only a finite number of true cyclic ele- 
ments, in order that H be cyclic element homogeneous it is necessary and 
sufficient that (1) the true cyclic elements be grouped in clusters with the 
same number in each cluster, (2) no cluster cuts H, (3) there eaists a point c 
of H such that each cluster 1s the same number, k, of branch points away from c, 
(4) at each i-th branch point (i —1,2,--+,k) from c there are the same 
number of branches, and (5) if C, and C, are any two true cyclic elements 
of H, there exists a homeomorphism of C, into Cs which sends the cut points 
of H on C, into the cut points of H on Cs. 


Proof. The necessity follows from 2.5. To show the sufficiency take any 
two components S,, 8S. of H—c. Denote the branch points on S,; by a and 
those on S, by y, as was done in 2.5. Go out along the arcs from c to 2 and 
to y' lying in S, and S, respectively. There are the same number of branches 
at z' as at y', by hypothesis. Out from z* and y’ on each minor branch there 
is a second branch point from c. There are the same number of branches at 
each of these by hypothesis. Out from the second branch points from c on 
each of the minor branches is the third branch point from c, and there are 
the same number of branches at each of these. 

Continue this until we get to the k-th branch points from c. By hypothesis 
there are the same number of branches at each of these points. Out from the 
k-th branch points on each of the minor branches is a cluster of true cyclic 
elements, for otherwise this branch would not have been in H. There can not 
be any branch point on any of these branches, for if there were there would 
be at least two clusters which had more than k branch points on the are from 
the center of the cluster to c. There is no cluster on any minor branch from 
x: and y' before the k-th branch point, for if there were there would be less 
than & branch points on the arc from its center to c. Thus there is a cluster 
of true cyclic elements at the end of each minor branch from the k-th branch 
points. By hypothesis there is the same number of true cyclic elements in each 
cluster. Since no cluster cuts H, every two of the clusters are joined by an 
arc in H which passes through at least ore branch point. Each true cyclic 
element contains only one point which cuts H, namely the center of the cluster 
in which the true cyclic element is contained. This follows from the fact that 
no cluster cuts H. 

It must now be shown that there exists a homeomorphism 7 such that 
if there be given any two true cyclic elements C;, C; of H, then T(H) =H 
in such a way that 7'(C;) =C,;. Take any two true cyclic elements (, and (>. 
By hypothesis there exists a homeomorphism W such that W(C,) =, and 
W(p.) = pe where p,eC,- H —C, and p,«C,-H—C,. Define the homeo- 


p 
Cl 
g 
Ww 
ge 
el 
Cé 
i 
Te 
ge 
8 
p 
he 
tr 


A TYPE OF HOMOLOGY FOR CONTINUOUS CURVES. 325 


morphism 7’ = W over C;. There are the same number of true cyclic elements 
in each cluster, hence the definition of 7’ can be extended so that T(K,) = Kz 
where K, C, and K,C;. If and poy* are the arcs from K, and K, 
to the branch points 2* and y* respectively, we so extend the definition of T 
that T(pia*) = poy*, and T(2*) = y*; also bt, where are 
the minor branches at x2* which do not contain p, and b+,;, the minor branches 
at y* which do not contain po, 1 = 2, 3,:--,m. If and y*y*" are the 
arcs from z* and to the (k —1)-st branch points, we define T (a*a*-1) = 
and T'(a**) =y*"*; also where are the minor 
branches at z** not containing x* and b+,;_, the minor branches at y*? not 
containing y*, 2,3,:--,7. Continue in ‘this manner until 2/—~y/ or 
until c is reached on both components. If b* and 6? are the branches from 
zi or c, as the case may be, which contain C, and C2 respectively, for each 
point xe b?, define = T-'(x); for each point ze H — define 
T(x) =a. We thus have a homeomorphism 7 which sends C, into C,, the 
component of H —c which contains C, into the component of H —c which 
contains C,, and H into itself. 


2.8. Definition. A set of points H is said to be bt1-cyclic element homo- 
geneous if, given any two true cyclic elements of H, there exists a homeo- 
morphism 7 such that T7(H) =H and the true cyclic elements are sent into 
each other. 

From the way the homeomorphism 7’ was defined in 2.7, it is seen that 
if a set H satisfies the conditions of 2.7% it is bi-cyclic element homogeneous. 

As was stated in the introduction, Fig. 3 is an example of a space M 
which is cyclic element homogeneous but it is not bi-cyclic element homo- 
geneous. The true cyclic element C, may be sent into any other true cyclic 
element of the space by a homeomorphism which sends M into itself; but 0, 
can not be sent into C, by a homeomorphism which sends C, into C, and M 
into itself. 

It may be remarked that the finite case just treated could have been 
reduced to the consideration of an acyclic curve which was end point homo- 
geneous. However, little if any advantage in simplicity seems to accrue from 
such a reduction. 


3. The case of infinitely many true cyclic elements. Although no com- 
plete solution for this case of the problem has yet been obtained, we shall state 
here some results bearing on certain important phases of it. 


3.1. If H is cyclic element homogeneous and contains infinitely many 
true cyclic elements which are grouped in clusters, where no cluster cuts H, 


326 CHARLES H. WHEELER, III. 
then there are a finite number of clusters with an infinite number of true 
cyclic elements in each cluster. Also under this hypothesis there exists a 
point c such that each cluster is the same number of branch points away from c 
and at each i-th branch point (1 =1,2,--.-,h) from c there are the same 
number of branches. It can be shown that the necessary and sufficient con- 
dition for this case is similar to that of the finite case. 


3.2. Let H satisfy the conditions: (1) it contains infinitely many true 
cyclic elements {Cj}, (2) each cut point is contained in exactly two true 
cyclic elements, (3) each component of H — C; has a different boundary point, 
(4) the set of cut points of H is totally disconnected, and (5) no point p is 
the limit of true cyclic elements in more than one component of H — p. 

Under these restrictions, a necessary condition that H be cyclic element 
homogeneous is that when C, and C; are any two true cyclic elements of H, 
there exists a homeomorphism of C, into C; such that one particular cut point 
in C, is sent into one particular cut point in C; and the remaining cut points 
in C, are sent into the remaining cut points in C;. 

Thus far it has not been shown that this condition is sufficient for H 
to be cyclic element homogeneous. But a sufficient condition is that when 
C, and C; are any two true cyclic elements of H, there exists a homeomorphism 
T of C, into C; such that two particular cut points in Cs are sent into two 
particular cut points in C; and the remaining cut points in Cs are sent into 
the remaining cut points in C;. 

It is to be noted that the sufficient condition is stronger than the necessary 
condition. 


THE JOHNS HOPKINS UNIVERSITY. 


—- 


t 

a 

t 

f 

f 

0 

a 
f 

a 

( 

t 

i 

8 


Qo 


A NAVIGATION PROBLEM IN THE CALCULUS OF VARIATIONS.* 


By E. J. McSHANE. 


The problem which I shall consider in this note is closely related to the 
Zermelo Navigation problem.’ Let us suppose that the velocities relative to 
the air which can be attained by an airship consist of all the vectors r lying in 
a convex body K(a,t), depending on the position 2 = (a', 2’, xz*) and the 
time t. The air is supposed to be in motion, its velocity being a continuous 
vector function u(x,t). Given two points 2, #,, the problem is to find a path 
from x, to z, which can be traversed by the ship in the least possible time. 
If K(a,t) is the sphere? |r| =k, where & is a constant, and if we add the 
further requirement that the speed relative to the air shall almost always be 
exactly &, this becomes the Zermelo navigation problem. Our replacement 
of the sphere | r | =k by the convex body K is suggested by the fact that an 
airplane can travel faster down than up. 

In the present paper I first prove under weak hypotheses the existence of 
a solution of the problem proposed above. I then consider the problem modi- 
fied so as to be a generalization of the Zermelo problem (i. e. the ship’s velocity 
is required to be almost always as great as possible), and under stronger 
hypotheses I prove that this problem also is selvable. 


1. Throughout the following pages we shall use the following definitions 
and assumptions: 

A is a bounded closed point set (atmosphere) in (2', x’, x*)-space. 

A is a bounded closed set of real numbers 1. 

K(a,t) is a bounded closed convex point set (or set of vectors) in three- 
dimensional space, defined and continuous* for ce A and —#» <t< ow. 

u(az,t) is a vector function (u'(a, t), u?(a, t), t)) defined and con- 


tinuous for xe A and all ft. 


* Received December 3, 1936. 

For a detailed study of this problem, as well as a bibliography of previous papers, 
the reader is referred to a memoir by B. Mania, shortly to appear in Mathematische 
Annalen. 

na | is the length [2(r+)?]% of the vector r. 

* For any convex set K, let K, be the set of all points having distance = e from K. 
A convex set K(@) is a continuous function of 6 at 4 if there is a neighborhood U of 4, 
such that for every @ in U the inclusions K(9) < K,(@) and K(@) < K (9%) hold. 


€ 
327 


328 E. J. MCSHANE. 


V(a,t) is the set of all vectors of the form u(z,t) +r with re K(z, t). 
That is, V is the translation of K by the vector u, and so it satisfies all the 
conditions imposed above on K. 

Let the path of the ship be given in the form C:4=—2(t),aStSa+T, 
where the parameter ¢ is the time. The mean velocity of the ship between 
times ¢, and is (4(t2) and we shall assume that this 
is bounded. The velocity at time ¢ is a’(t), if a(t) exists. The velocity 
relative to the air is then 2’(¢) — u(x(t),¢), and this must be with the class 
K (a(t), t) of velocities attainable at place z(t) and time t. That is, by the 
definition of V(a,t) we must have 2’(t) e V(z(t),¢). Combining this with 
the previous requirements on the paths x(¢) to be considered, we are led to the 
definition : 

The curve C: r=2(t), aStSa+T is admissible (or, more fully, 
an admissible curve traversable in the interval [a,a-+ 7']) if 


(1.1a) the functions x(t) are Lipschitzian ; 

(1.1b) 2(t)e A foraStsa+T; 

aeA; 

(1.1d) wz(a) e(a+T) 

(lle) a(t) eV(a(t),t) wherever z’(t) is defined. 


It is convenient for the proof to have also the notation of a weakly admissible 
curve. ‘A curve C is weakly admissible if it satisfies (1. 1a, b, c, d) and satis- 
fies (1.1e) if we replace the words “ wherever 2’(t) is defined” by “ almost 
everywhere.” 

We obtain an obviously equivalent definition by changing parameters from 
t to r= (t—a)/T; a curve C: x—2(r), is an admissible curve 
traversable in the interval [a,a-+ 7] if 


(1.2a) the functions x(r) are Lipschitzian ; 

(1.2b) 2(r)eA forOSr=1; 

(1.2c) aed; 

(1.2d) 2(0) —2, 27(1) =—2,; 

(1.2e) aw (r)/TeV(x(r),a+ Tr) wherever z’(r) is defined. 


Likewise, if in (1.2) we replace the words “ wherever z’(r) is defined ” by 


“for almost all r” 


we obtain a definition of a weakly admissible curve. 
If a function F(t) is defined and summable over a set H — N, where £ 


is measurable and N has measure 0, we shall define 


F(t) dt =f 


q 
4 
i 


A NAVIGATION PROBLEM IN THE CALCULUS OF VARIATIONS. 829 


This allows us to write, for example, 


b 
f a’ (t)dt =2(b) 
a 
if x(t) is absolutely continuous on [a, b]. 
2. We now can state: 


THEOREM I. Under the hypotheses of §1, if there exists an admissible 
curve, then there is an admissible curve C traversable in a time interval 
[a,a+ TT] for which the time of traversal T is the least possible. 


Let 7) be the greatest lower bound of the times of traversal of all ad- 
missible curves and let W be the lower bound of the times of traversal of all 
weakly admissible curves. Since the latter class contains the former, W S T>. 

We now choose a sequence of weakly admissible curves Cy: 4 =@n(r), 
071, traversable in the respective intervals [@n,an-+ Tn], for which 
T,— W. From these we can select a subsequence such that a, tends to a 
definite limit a); we suppose that {Cn} is already such a sequence. Since A is 
closed, de A. All the time intervals [dn,an-+- 7'n] lie in a bounded closed 
time interval, and all z lie in the bounded closed set A, and V(a,¢) is con- 
tinuous; so for all such (a,¢) the body V(z,t) lies in a sphere about the 
origin of finite radius M. Then | 2’n(t)|/Tn = M, so | 2’n(r)| is uniformly 
bounded. Hence by Hilbert’s theorem we can select a subsequence of the 
tn(r) which converges uniformly to a Lipschitzian limit function zo(7). We 
wish to prove that a (7) is the curve sought. The proof will be given in a 
lemma. 


LemMMA 2.1. Jf the curves Cn: t=@n(r), OS7rS1 are weakly ad- 
missible curves traversable in the intervals [n,n + Tn], and 


(2.3) Tn—>U, > uniformly in 


then Co: x2 (r) is an admissible curve traversable in the time interval 


(aa+U). 


Condition (1. 2a) clearly holds, for z(t) has bounded derivatives as we 
saw just above. (1. 2b) holds by the closure of A, and (1.2c) by the closure 
of A. (1.2d) is evident, for 2 >2%(0) and 2, = 2n(1) > 2,(1). 
We now turn to the proof of (1. 2e). 

Suppose that rt) is any number in [0,1] such that a’o(7) exists. Ife is a 
positive number, we define V, to be the set of all points whose distance from 


= 


330 E. J. MCSHANE. 


V(2o(to), @ + U7.) is S «. By (2.3) and the continuity of x(r) and 
V (a, t), we find that 


(2.4) there exists a 8 > 0 and an integer ny such that if |r—1)| <8 
and n > %%, then V(an(t),4n + Tnt) < Ve. Therefore, for all n > ny and 
almost all + such that | 7—7)| < 4, 


(2. 5) Ve. 


Now suppose 0< |h| <8. Since V, is closed and convex, by Jensen’s 


inequality * 
+h 


(an(t>o + h) = (1/h) Ve 


Let h be fixed and let n—> «©. By the closure of Ve, 
+ h) —2%o(70) ]/h = lim [an(t> +h) —an(10) ]/he Ve. 


OO 


Now let h 0. Again by the closure of € Ve. But here the point 
2 o(t>) does not depend on «, and it can only belong to V, for every e > 0 
if it belongs to + Uro). Hence (1. 2e) is satisfied and 
Lemma 2. 1 is established.* 

Returning to the proof of Theorem I, by Lemma 2.1, the curve C, is an 
admissible curve traversable in time W = 7,. But the time of traversal of any 
admissible curve is = 7. Hence the time of traversal of Co = W=Ty. 
This completes the proof of Theorem I. 


Remark. We could alter the problem by assuming that u(z,¢) and 
K (x,t) are defined only for ¢ in a closed interval t) = tS ¢,; nothing in the 
preceding demonstration would be altered. 

From Lemma 2.1 we draw another conclusion: 


LeMMA 2.2. Every weakly admissible curve 1s admissible. 


For let C: =2(r), O07 S11 be a weakly admissible curve traversable 
in the interval (a4,a-+T7). In Lemma 2.1 we take =2(r) = 
T, = T =U for all n. Then the hypotheses of the lemma are satisfied, and 
the conclusion informs us that the limit curve (which is C itself) is an ad- 
missible curve traversable in the interval [a,a + T']. 


*Cf., for example, E. J. McShane, “On Jensen’s inequality,” Bulletin of the Ameri- 
can Mathematical Society, vol. 40 (1937). 

® We have proved a little more than we stated. If 7, is arbitrary, the above proof 
shows that every vector which is the limit of a sequence [a,(7) + h,,) — 7) 1/hy, 28 
h,, 79 must belong to V(#(7)),a + 77), even though 2’,(7,) may not exist. 


dei 


( 
t 
f 
b 
a 
a 
is 
vi 
to 
Se 
(3 
Si 
a 


A NAVIGATION PROBLEM IN THE CALCULUS OF VARIATIONS. 331 


3. For the next theorem we shall add the hypothesis that the ship’s 
engines are powerful enough so that at any time and place it can proceed with 
speed = 6 (8 a positive number) in any desired direction. Analytically, this 
means that the set of velocities which the ship can attain, namely 

V(a,t) =u(a,t) + K(za, t), 

shall contain all velocities v for which | v | <8; that is, the sphere | v | S68 
is contained in V(z,¢) for all in A and all ¢. If this is the case, for each 
direction (unit vector) d and each (2, ¢) there is just one number p = p(d, z, t) 
such that pd is on the boundary of V(a,¢t); and p = 8 for all d, all z in A and 
all ¢. It is quite easy to see that if p(d,z,¢) is continuous, then the body 
V(z,t) is a continuous function of (z,¢). It is somewhat less easy to see 
that the converse is true; *® so in order to save some space we shall hereafter 
replace the assumption that V(z,¢) is continuous by the assumption (only 
apparently stronger) that p(d,z,¢) is continuous. Then 


THEOREM IJ. Let the boundary of the body V(a,t) be given in polar 
coordinates by the equation p = p(d, x,t), where p(d,z,t) =8> 0 and pis a 
continuous function of all its arguments. Then, if there exists an admissible 
curve, there is an admissible curve C:4@=X(t),aStSa+T such that the 
time T of traversal is the least possible, and such moreover that 


0 < | X’(t)| =p(X"(t)/| |, x(t), t) 
for almost all t. 


By Theorem I, there is a curve C: x=2(t),aStSa+T which can 
be traversed in time 7 which is the least possible time of traversal of any 
admissible curve. Let x—€&(s), 0s L be the representation of C with 
arc length as parameter. To each ¢ in (a,a-+ 7’) there corresponds a value 
S(t) of the parameter s, and s’)(t) =| a’(t)| for almost all ¢. This function 
is monotonic increasing and absolutely continuous. It may not have a single 
valued inverse, but we define t,(s) to be the least ¢ such that s,(t) =s. Then 
to(s) is defined and single valued (possibly discontinuous) for 0 = s S L, and 
So(to(s)) =s. Also 
(3.1) t,(0) =a, t.(L) =a+T. 


Since z’(t) e V(a(t), t), we see that | 2’(t)| is bounded, say = M. Then for 
any times we find 


ty 
ty 


*This follows, for example, from pages 14 and 37 of Bonnesen and Fenchel, Theorie 


der konvexen Kérper. 


E 


332 E. J. MCSHANE. 


whence for all values s,, 2 > s, of s 


(3. 2) to (82) —to(si1) = (s2— 81) /M. 


The derivative ¢’,(s) exists and is finite for almost all s. Inequality (3. 2) 
shows = M-, so 


(3.3) for almost all s the deriwatives &(s), Uo(s), and s’o(to(s)) exist 
and are finite, and | &(s)| =1, and s’,(t)t’.(s) =1. 


From this and the identity 7(t) = &(so(t)) it follows that for almost all s 
the derivative 2’(¢) exists and 


(3. 4) a’ (t) = &(8)8’o(t). 


Let S be the set of all s (of measure LZ) on which (3.3) and (3.4) hold. 
We recall that for almost all ¢ (and, a fortiori, almost all s) 2’(t) e V(2(t), t), 
so almost everywhere in S 


(3.5) 0 < 9o(to(s) 2’(to(s))| Sp(2’(to(s) )/| 2’ (to(s) ) |, #(to(s) ), to(s)) 


= p(é(s), €(s), to(s). 
This implies that almost everywhere in 9 
(3. 6) t’o(s) = [s’o(to(s)) ]* = [o(é’(s), €(s), 


The function p(é’(s),é(s),¢) is defined for almost all s, is measurable in s 
for fixed ¢ (being a continuous function of the measurable functions é(s), 
€(s)) and is continuous in ¢ for fixed s. Moreover p=&. Hence for every 
e > 0 the function [p(é’(s), €(s), ¢) ++ €]~* is measurable in s for fixed t, con- 
tinuous in ¢ for fixed s, and bounded. Therefore the equation 


& 
(3.7) te(s) —a—e+ te(s)) + 
has an absolutely continuous solution ¢.(s) on (0, LZ). We now prove 


(3. 8) If <e, then tg(s) > t-(s) (0SsSL). 


The graphs of t = tg(s) and t = ¢,(s) are continuous curves. The latter 
is obvious, for ¢.(s) is continuous. So is the former if B>0. If B=0, 
the graph can still be considered as the continuous curve s = )(t), 
aStSa+T. It is clear that (3.8) holds at s 0, for 


tg(0) —t.(0) = (a—f) — (a—e) > 0. 


] 

§ 

( 

( 
I 

( 
a 
( 
B 
fc 
al 
Si 
Sc 
ca: 
By 
ex 
(3 


A NAVIGATION PROBLEM IN THE CALCULUS OF VARIATIONS. 333 


If (3.8) is not always true, the graph of t = ¢,(s) lies somewhere above the 
graph of ¢ = ¢g(s), and so they must have a first intersection point (0,7), 
= =o. (Here is the inverse of ¢.(s).) Thus fors<o 


(3. 9) ta(s) > te(s). 

By the uniform continuity of p, there is an h > 0 such that 
(3.10) p(é’(s), €(s), < p(é(s), €(8), tz) if | <h. 
By the continuity of ¢.(s), there is a k > 0 such that 
(3. 11) 0St.(c) —t(s) <h if 
So for so—k=s <o we have, by (3.9) and (3.11), 

—hSt.(s) < tg(s) Sr, 

and therefore, by (3.10), for almost all s in (o —k, co) 
(3, 12) p(é’(s), €(s), ta(s)) +B < p(€(s), €(s), te(s)) +e 
By (3.12) and (3.7) (and (3.6) if B =0) 


(3. 13) t’p(s) > 7’e(s) 
for almost all s in (o—k,o). Now 7g is monotonic increasing and 7, is 


absolutely continuous, so,” by (3. 13) 


—te(o) = —k) +f p(s)ds—re(o—k) — f r’.(s)ds 
a-k o-% 
> — k) —1(o— k) > 0. 
Since r, is continuous and rg is monotonic increasing, we can find an interval 
on which 
tp(S) —7te(s) = > 0. 
So the graph of zg lies above the graph of for and (g,7) 
cannot be an intersection point. This contradiction establishes statement (3.8). 
Now let « tend to 0 through a monotonic decreasing sequence of values. 


By (3.8) the successive functional values ¢,(s) increase for each s, so there 


exists a limit function: 


(3. 14) lim t.(s)—=7(s), OSsSL. 


* Hobson, Theory of Functions of a Real Variable, vol. I, p. 590. 


334 E. J. MCSHANE. 
Also by (3.8), te(s) Sto(s) for each e > 0, so in the limit 
(3. 15) r(s) St(s), OSsSL. 


In equation (3.7), the integrand on the right is uniformly bounded and tends 
almost everywhere to [p(é’(s), €(s),7(s))]~, while on the left t,—>7; so 


(3. 16) J, [p(é’(s), €(s), r(s)) ]*ds. 


The function r(s) has almost everywhere a positive derivative, by (3.16), 
so it has an absolutely continuous inverse s = s(r),aS7rSr(L). Defining 
X (7) =&(s(r)), we find that for almost all + 


X’(r) = &(s)s’(7), s(r) >0, 
so X’(r)/| X’(r)| = €(s) for almost all +. Moreover, for almost all 7 


(7) = &(s)[7’(s) = &(s) p(€(s), €(s), 7(8)) 
)p(X’(r)/| X’(r) |,X(r), 
whence 


| | = p(X’(r)/| X’(r)|, X(7), 7) 


for almost all +r. That is, condition (2. le) holds with the words “ wherever 
X’(r) is defined ” replaced by “for almost all 7.” The others of conditions 
(2.1) are trivially easily verified, so c= X(r) is weakly admissible, and by 
Lemma 2.2 it is admissible. All that remains to prove is that the time of 
traversal +(Z) —a has the least possible value T. Clearly r(L) —a=T. 
By (8.15) and (3.1), r(L) St,.(L) =a+T, completing the proof. 


Remark. If we change the problem by assuming that u(2,t) and K (z, t) 
are defined only on a time interval t; = ¢t,, one point of the preceding 
demonstration needs change. In defining ¢,, times t < a entered. If however 
we define V(az,t) = V(a, t)) for t < to, the proof can be carried out as above, 


and the final results will involve only times ¢ in [ fo, ¢,]. 


UNIVERSITY OF VIRGINIA. 


a 
Pp 
n 
g 
8] 
0! 
01 
di 
g 
ql 
la 
m 
di 
ge 
m 
fu 
C0 
ne 
Wi 
tic 
of 
ge 
to 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN 
SURFACE OF GENUS ». 


By Oscar ZARISKI. 


1. Introduction. The symmetric n-th product K” of a complex K carries 
a subset D whose points represent n-tuples on K with two or more coincident 
points. We call D the discriminant variety of K" and we refer to the funda- 
mental group of the residual space A"— D as the topological discriminant 
group (of degree n) of the given complex K. In part I we determine this 
group, Gn,p, When K is a Riemann surface Rk of genus p. We were led to 
examine this group by the following considerations. The variety R” is the 
space of all n-tuples of points of an algebraic curve f of genus p. As such, 
Rk" carries—for a sufficiently high value of n (n= p+ 2)—a system, ?, 
of linear (n — p)-spaces Sy_p, images of complete linear series gn”? existent 
on f. If D, denotes the intersection of a general Sy» of the system with the 
discriminant variety D of R”, the fundamental group of the residual space 
Sn»— D, can be shown to be an invariant subgroup Hy,» of Gn», and the 
quotient group Gn,»/Hn,» is simply isomorphic to the homology group of PR. 
If Gn,p is known, H,,» can be determined on the basis of well-known principles 
laid down by Reidemeister.t. By a theorem which we have proved elsewhere 
(Zariski®) the fundamental group of S,».».—D, coincides with the funda- 
mental group of the residual space of a general plane section C of D,. It is not 
difficult to see that C is the plane dual of a general plane curve of order n and 
genus p, so that C is of order 2m + 2p—2 with 3(n + 2p—2) cusps and 
2(n — 2) (n— 3) + 2p(2n + p—7) nodes. The knowledge of the funda- 
mental group of C, of interest in itself, makes it also possible to determine the 
fundamental group of any plane curve admitting C as a limiting case. In this 
connection we may point out that the class of curves thus obtained is not 
negligible, since, at present, duality constructions and limiting processes are, 
with a few exceptions, the only means of arriving at effectively existent curves 
with nodes and cusps. 

In Parts II and III we carry out in detail the above outlined considera- 
tions in the case p—=1. The somewhat elaborate group-theoretic apparatus 
of Part II is inherent to the reduction of the infinite set of generators and 
generating relations of H,,, (an invariant subgroup of G,,, of infinite index) 
to a finite set of generators and generating relations. The existence of such a 


335 


336 OSCAR ZARISKI. 


finite set is, a priori, implied by the algebro-geometric interpretation of H,, 
given in Part III. 

An interesting special case, examined in Part III, is given by the dual 
of a plane cubic—a sextic with 9 cusps. It is then found that the 9 generating 
relations at the cusps enjoy properties which are in striking analogy with the 
well-known alignment properties of the configuration of the 9 flexes of a plane 
cubic. 


2. Let R be a Riemann surface of genus p and let R” be the symmetric 
n-th product of R, i.e. the space (of complex dimensions) of all unordered 
n-tuples of points of R, topologized in an obvious manner. We have shown 
elsewhere (Zariski,® p. 1), that R" is a manifold. We denote by D the sub- 
variety of R” whose points correspond to n-tuples of points of R in which two 
or more points coincide. This variety D—which can legitimately be designated 
as the discriminant variety of R"—is of n—1 complex dimensions. The 
purpose of this and of the next two sections is the determination of the funda- 
mental group of the residual space R"— D. We denote this group by Gn». 

The group Gn,o is known (see Zariski,* p. 612). As in the just quoted 
paper, we interpret also here the group Gn,» as the group of motion classes of 
n points of R. The motions considered are those which carry a fixed initial 
set of n distinct points P,, P.,- - - , Pn of R into its initial position (allowing 
for a permutation of the points P;) and in the course of which the variable 
set consists always of distinct points. Two motions belong to the same class 
if they can be deformed into each other through a continuous chain of motions 
of the same nature. 

We fix on a set of retrosections @;,@2,° on a common point P,, 
belonging to our initial n-tuple of points. We choose our retrosections in such 
a manner that when PF is cut open along them, the resulting 2-cell F, is 
bounded by the closed polygon 


We assume the points P2,- - -, P» in the interior of H, and we join the points 
P,, Pn by a set of simple oriented arcs s;, Sn-1 (see figure 1). 


The indicated orientations of the retrosections a; and of the arcs s; are such 
that at P, the positive sense on each retrosection a; points from the right-hand 
edge of the arc s, toward its left-hand edge (see figure 2). 

We denote by g; the motion in which P; is carried into P;., along the 
left-hand edge of s; and P;,, is carried into P; along the right-hand edge of si, 
while the remaining points P; are fixed in their initial positions. In the case 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN SURFACE. 337 


Fig. 1. 


Ga, 
P 2 ~ 
P, 
B 
S2 p 
a Sn-l 
2p 
Q, 
P P 
4 
4 4 7 
Gs \ 
\ 
/ \ 
/ \ 
7 
| | 
| 
\ / 
\ / 
\ 7 
\ 
2. 
8 


338 OSCAR ZARISKI. 


p= 0 the elements 91, 92,° are generators of Gn,o (see Zariski,* p. 610). 
It follows that any motion in the course of which the points of the variable set 
do not cross the boundary of H, can be expressed as a product of the g,’s. Let 
us consider the motion in which P, describes the oriented retrosection aj, 
while the remaining points P;, are fixed (k >1). We shall denote this motion 
by the same letter a;. It is obvious that any crossing of the boundary of LF, 
introduces factors a;*1, hence the elements *,9n-15 * are the 
generators of Gn,p. 


3. The generating relations of G,,,. The relations 
(a) 9195 = 959i: A1; 
(B) is Gi = Jin --,n—2); 


established in our quoted paper (Zariski,* p. 612) remain valid also in the | 
present case. The relation (6) of the quoted paper now has to be replaced 
by the following: 


since the left-hand member represents a motion which, as is easily seen, can be 
deformed into one in which the points P.,- - -, P» are fixed, while P, describes 
a closed path surrounding the set {P.,---,Pn}. This closed curve can be 
deformed into the boundary of the cell #,. Other generating relations, in- 
volving the g’s and the elements a;, are obtained as follows: 

In the first place it is clear that each a; is permutable with each of the 
elements * Yn-1, since the corresponding paths do not intersect. Hence 


The motion g,~‘aig,-t can be deformed into a motion in which the points 
P,,P;,: Pn are fixed, while P, describes a retrosection homologous to 4 
and not meeting a; (see figure 2, the path of P, is indicated by the punctuated 
curve). Hence a; and g,‘aig:~ are permutable, whence the relation 


(gi*ai)* = (aigi™)?. 
We now introduce the following elements: 
(#) = "049s. 


It is clear that the motion a’; is equivalent to a motion in which P,, P;,- - °, Pa. 
/ 
are fixed, while P, describes a retrosection a’; homologous to a;, and that @: 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN SURFACE. 339 


and a; intersect in one point only, provided j > 1% (see figure 3, illustrating 
the behavior of a’,). If we then consider the direct product of a’; and aj, 
regarded as 1-spheres, we have a torus 7’, on which the motions a; and a’j, 
regarded as motions of the point pair P,P, (the remaining points P3,- - -, Pn 
being fixed), are represented by two retrosections « and @’ respectively. The 
common point P of « and @ corresponds to the initial point-pair (Pi, P2). 
Let Q be the point at which a’; and a; intersect, and let Q be the point of the 
torus which corresponds to the point pair (Q,Q). A closed path on T—Q 
starting from and returning to P represents a motion of a variable pair of 


Fie. 3. 


distinct points starting from and returning to the initial point pair P;, P2. 
A deformation of this path on JT’ — Q corresponds to an allowable deformation 
of the corresponding motion on R. Since on 7 we have ¥, 
where ¥ is a properly oriented loop issued from P and surrounding the point Q, 
we have a corresponding relation aja’;~*a;"‘a’; = y, where y is the motion of 
the variable point pair on R which corresponds to the loop 7 on T. To 
determine y, we take as y a quadrangle two of whose sides are on the retro- 
sections @ and @ and the other two are parallel to these retrosections (see 
Fig. 4). The corresponding motion y has now the following description: 
(a) first the point P, describes the arc P,M on aj, P2 is fixed; (b) then P, 
describes the arc P.N ona’;, while the second point is fixed at M; (c) a reversal 
of motion (a) ; (d) the reversal of motion (b) (see Fig. 3, where j = 3, i= 2). 


ae 
/ 
/ 
/ / 
{ 
a 
/ 3 
N 7 
/ 
/ 
d, M a, 
a, a, 


340 OSCAR ZARISKI. 


By letting M approach P, on aj and by accompanying this by a deformation 
of the path (b), we see immediately that the combined motion y can be 
deformed into a motion in which P, is fixed and in which P, turns around P, 
in what on Fig. 3 would be the clockwise sense. This motion is visibly equiva- 
lent to the motion g,*._ We have therefore the following generating relation: 


(v) = 9g’, 


We prove in the next section that the relations (a), (B), (y), (8), (€) and (v) 


a 


/ 
IN 
= > 
a P 
Fia. 4. 


(where the elements a’; are defined by (u)) constitute a complete set of 
generating relations of Gn,p. 


4. We denote the abstract group defined by the relations (# — v) of the 
preceding section by Gn,p, and we use the notations = and = to indicate 
equality of elements in Gn,p and Gn,» respectively. We wish to prove that Gag 
coincides with Gn,», or what is the same that « = 8 implies «= 8, where a and 
and @ are products of the generators ai, gx. The proof will be made in several 
steps. 


(a) If Wis any product of the generators ai, gx, 
where 0 = h=n—1 and where W, involves only the elements , Jn, Mis a’, 
and the elements 


l 
i 
a 
Z 
H 
( 
i 
r 
m 
| 
by 
be 
an 
(a 
(y 
fo 
se 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN SURFACE, 341 


The proof is the same as the one given in Zariski,* pp. 612-613, except that 
it is also necessary to make use of the relations 


The elements s;, denoted in Zariski,* p. 612, by aj, represent motions in which 
the points P,,- - -, Pn are at rest while P, describes a loop around the point Pj. 


(b) If W represents a motion in which P, returns to tts initial position, 
then W = W,. 

This is a consequence of (a) since in gn: - « giW, the point Pr,: is carried 
into P;. 


(c) The subgroup T of Gus generated by the elements Sn, * * 
is an invariant subgroup of the group generated in Gn» by the above elements 


That gx**sjg.** CT for k =2,- --,n—1 has already been proved in 
Zariski,* p. 613. In view of (8) it remains to prove that a’;*'Ta’; CT. 

The relation shows that a’; is commutative with 
Hence either one of the two relations 


implies the other. Now (e) also implies the relation (a’;)~taja’; = sq a48o. 
Consequently both relations (1) hold true for e=—1. Transforming the 
relation = by (a’;)-? and taking into account the com- 
mutativity of the elements a’;, s.~'a;, we find that belongs tol’. Hence 
the relations (1) hold true for e= + 1. 

Since > 2, involves only the elements -, gn-1, it follows 
by (8) that g,sjg,-! is commutative with each a;, and hence s; is commutative 
with a’; (= g,"aig,), for 7 > 2. 

It remains to prove that all the transforms (a’;)‘aj(a’4)"~,e = 41,14, 
belong tor. For e=—1 andi < j this follows directly from the relation (vy), 
and for e = + 1 and i < j this is proved by transforming the relation (v) by 
(a’;)-1, since we have already proved that a’;g,7a’; is in’. We now transform 
(v) by g, and we obtain the relation = Since CT, 
fore = + 1, it follows immediately that a’j‘aja’;* C T, and this completes the 
proof of the invariance of the subgroup TI. 

(d) As a consequence of (b) and (c) we may now assert that if W repre- 


sents a motion in which the point P, returns to its initial position, then we 
have already in G,,p a relation of the form: 


342 OSCAR ZARISKI. 


where W, and W, are products of the elements indicated in the parentheses. 


(e) To prove that the groups Gn,» and Gn» are identical, we use an in- 
duction with respect to n, since for n = 1 the group Gn,» is merely the funda- 
mental group of the Riemann surface R, and in this case the relation (v), 
where now the left-hand member reduces to 1, is the only generating relation 
for Gn». Hence G;,» coincides with G,,». We shall then assume that Gn-1» 
and Gn-1,9 are identical groups. For Gn-s,» we take as initial sets of n—1 
points the points P2,- --,Pn, and as generators the elements gn-1, 
* *, As elements analogous to (—a’;) we take the elements 
a, 

Let W be an element of Gn,p, expressed as a product of the generators gz, ai, 
and let W 1 be a true relation in Gn». Since in the motion W every 
point P; returns to its initial position, the representation (2) of W holds 
true. Since in the motion W, the point P, is fixed, while in the motion 
W, the points P.,- - -,P, are fixed, it is clear that W, —1 must be a true 
relation in Gn-1,7. By our induction, this relation must be a consequence of the 
relations («—v) for the case n—1. To rewrite these relations for the group 
Gn-1,2 We must replace gi,- by and the elements a; by the 
elements a’;. Let us letter these new relations by @’, B’,- - -,v. 

We assert that the relations (B’), (8), (€), are group-theoretic 
consequences of the relations (a), (8), (8), (€), (v). The relations (@’), (f’) 
are among the relations (a), (8). As for the relations (8), we observe that 
by (@), g: is commutative with gx, k = 3, and hence the relations (8’) are 
obtained by transforming by g, those relations (8) in which k= 3. Finally, 
the relations (¢’), (v’) are the transforms of (ce) and (v) by gog:. In fact, 
since g2 and a; are commutative, we have 


(9291) (9291) = 91 = 2s. 


Taking into account the relation 919291 = 929192 we find 


(9291) (g291) = (9192) (9192) = ig2 


Moreover, we have (929:)~*9:(g291) =g2, aS a consequence of the relation 


919291 == 929i192- 
We have left out the relation (y’), i.e. the following: 


t 

t 

I 
q CC 
1, 

i 
to 

W 

Tl 

in 

fi 

gr 
th 

el] 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN SURFACE. 343 


This is not a true relation in Gn,». In fact, if we transform (y) by g:, we find 


the following relation: 
H=g,;"; i.e, H=sz. 


Having thus proved that all the generating relations of Gn-s,», except the 
relation H = 1, are also true relations in Gp,», and recalling that W, = 1 holds 
in Gn-1,» we deduce that W,, as an element of the group Gn,p, can be expressed 
as a product of transforms of H, i. e. of s2, the transforming elements involving 
only the generators g2,--+,9n, a4 Of Gn-s,» Hence, by (c), we can write 
W, = W’,, where W’; CT. In view of (2), we conclude that if a relation 
W =1 holds truein Gn», then we have in Gn» W =F 
where F is a product involving only the element aj, s;. 


The elements Gop, S2,* *, 8m are generators of the fundamental 
group G* of the Riemann surface with n—1 holes at P2,---,Pn, i.e. of 
It can be proved, as in Zariski,* p. 614, that the relation 


F =1, true in Gn», implies that F’, considered as an element of G*, belongs 
to the center of G*. Since G* is a free group, for n = 2, it follows that F is 
the identity in G*. The only generating relation of @* is the following: 


a,a,7) apt Ae dep == $283° * Sn. 


If the s;’s are replaced by their expressions in terms of the gx’s, this relation 
coincides with the relation (y). Hence / —1 is also a true relation in Gy», 
i.e. we have F=1. Consequently W —1 implies W =1, q.e.d. 


II. On an invariant subgroup of G,,, in the elliptic case. 


5. In a motion which carries the initial n-tuple P,,- - -, Pn back to its 
initial position, the paths described by the points P,,--~+,Pn constitute 
together a closed curve, a singular 1-cycle o,. Those elements of Gn,» for 
which this cycle ¢, is ~ 0 on the Riemann surface R form an invariant sub- 
group Hn,» of Gn,p, and the quotient group Gn,p/Hn,p is the homology group of PR. 
There is a general procedure, given by Reidemeister,’ for determining the 
generators and the generating relations (finite or infinite in number) of an 
invariant subgroup of a discrete infinite group, whether the quotient group is 
finite or infinite. We shall now apply this general method to the invariant 
group of Gn,» in the elliptic case (p = 1). It will be seen that admits 
afinite set of generators satisfying a finite set of relations, although the quotient 
group is in this case a free abelian group. This could also be foreseen from 
the geometric considerations of Part III of this paper, where it will be shown 
that the group H»,, is the fundamental group of the residual space of a certain 


elliptic plane curve. 


344 OSCAR ZARISKI. 


The reduction of the set of generators and of generating relations of Hn), 
which we are about to undertake, can be extended tothe group Hy», p arbitrary, 
in so far at least as the explicit determination of a finite set of generators is 
concerned. As for the generating relations, a similar reduction presents some 
difficulties. 

From now on we shall denote the groups Gn, and Hn, by Gn and H, 
respectively. G, is generated by the elements de, g1,° *,9n-1. We rewrite 
the generating relations of G, as follows: 


(Ti, = =1, |i—j| 1; 
= 919i = 1S 
(3) T  =91° * Gn-29?n-19n-2° 9id2 = 1; 7 
PR = 1, (k 
(ag1")? (gia)? = 1, (t= 1,2); 
q = = 1, = 91191. 


Since a}, d2, considered as 1-cycles on Ff, are generators of the homology group, 
it follows that there is a (1—1) correspondence between the elements a,*a,! 
of Gn and the elements of the quotient group Gn/H»n. It is also clear that the 
elements of H, are those and only those elements of G» which become equal 
to 1 if the relations g; = 1, a,-‘a,a,a.°' = 1 are added to the generating rela- 
tions of Gy, i.e. those power products of the generators a1, dz, g, in which the 
sum of the exponents of each of the elements @;, a2 vanishes. It follows, by the 
quoted paper of Reidemeister, that the following elements are generators of Hn: 


== 

= gx (di‘a2/)*, (k = *,;n—1). 


The elements @’;; are all identically equal to 1, and hence we are left with the 

Generating relations of H, are obtained as follows. In the first place a 
definite construction is given by means of which any power product z of the 
generators of G, can be expressed in the form 7,4,‘a./, where 7, is a power 
product of the generators of H,. Here the exponents i, j are uniquely de- 
termined by z, since z and a,‘a,/ correspond to one and the same element of the 
quotient group G,/H». This construction is as follows. Let + = 7’d*!, where 
A. is a generator of G, and where 7’ contains less factors than z, and let us 
assume that we have already expressed 7’ in the form 7’,a,‘a,.J. Among the 
generators (4) of H, there is an element Ax; of the form (a,‘a.4)A(a,"a2")* 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN SURFACE. 345 


and there is also an element dj; of the form (a,*"a25”)A(ay!a.4). Then, if 
=X, We write = and if t= we write = a)”. 
Using this construction, we obtain all the generating relations of Hy as 
follows: (a) we express the power products which occur in the relations (3) 
by means of the generators of H, and we put equal to 1 the resulting expres- 
sions; (b) we apply the same procedure to the transforms of the relations (3) 
by any of the elements a,‘a./; (c) we finally apply our construction to the 
elements getting = where mi; is a power product of the 
generators of H», and we put mj; = 1.* 
It is immediately seen that the relations 7;; —1 give only the following 
trivial relations: + 
= 1, for all and 7; 
@ == 1, for all 1. 


The relations merely imply gijx = gx and = 9x%ij 
for all 1, 7 and for k = 2. 

Reassuming, we have at present the following generators and generating 
relations for the group H,: 


Generators of 
Jij (1,7 = 0, +1, + 2,- Ja." *5Gn-1- 


Generating relations of Hn: 


“The relations 7;, = 1 replace in the present case the relations 7,,F,,T,,-1 given 
by Reidemeister,? p. 13. We use the notations of Reidemeister and we prove that, quite 
generally, the ng* relations T,,F,.,T,,-1 = 1 can be replaced by the g relations r,, (S,,) = 1, 
where 7,,(S;;,) is the power product of the S,,’s which we get if we express 7, in the 


m 
form 7,,7,,, according to Reidemeister’s construction. Let us first consider the case in 


which m ¥ g, i.e. T,,, is not the element 1. In this case T,,F,,,7,,-1 contains S87) and 
hence (see Reidemeister,? p. 11) in the course of the construction Pt must be replaced 
by But then we get an expression which can be changed into 7, 
by using the trivial relations S,;8;-1 1. Hence the relation 7,,F,,T,, = 1, expressed 
in terms of §,,’s, can be changed into the relation 7,,,7,,,-1 = 1 by using the trivial rela- 
= 1. Now it is immediately seen that this last relation coincides with the 


tions 8,87) = 
relation 7,-1(S,,.) =1. Let now m=g. In this case we have the generating relation 
F,,=1; but F,,, expressed in terms of the S,,’s has the following form: 

-1 


and hence the relations F,,, = 1 are consequences of the relations 7, = 1. 


t If, for instance, i=0 and j=0, then m,, ja 


= 
0 


346 OSCAR ZARISKI. 


(5) (ay 'a24) (a1 1, ) 
(k,l —=1,2,- -,n—1,k 41) 

(5) TE) — (a,tas)T =1, 


-(1,7 = 0, +1, + 2,- 


=1, 2); 
(5”’) = (4, = 1 J 
(5a) Gio = 1, (t—=0,+ 1,+2,-- 


(5b) = (K=2). 


6. The expression of the elements in terms of 
kl ? ? 


the generators of H» leads in a straightforward manner to the following 


generating relations (where i,j =0,+1,+ 2,: 

(6) PEP = 91945, >; 

(6”) 94592Gis = 

(6’”) Gu Je = JkJk+15 

(9) ; 94,539, 1. 


The relations (8) imply that 9;,;.:9:,; is independent of j. Let for brevity, 


(10) 94,5419i,4 = 


The recurrence relations (9) allow us to express all a,;’s in terms of the 
Japs, %,o. Taking into account (5a) and (10) we find 


+1? 


Substituting these expressions of the «;;’s into the relations (7) and taking 
into account (10) we find in a straightforward manner that the relations (7) 
can be replaced by the following relations: 


9419 441,19 441,094,092 = 1, 


We have thus obtained a first reduction of the algebraic expression of the group 
H,,: as generators of Hn we have the elements gij;, (1,7 =0, +1, +2,° °°), 
* the generating relations are (5b), (6)-(6”"), (7), (81), (82), 
where the elements a; in (8,) are defined by (9’), and (10). 

Since = 1, the relation (8,) for = 0 yields the following relation: 


+ 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN SURFACE. 347 


(11) 94+1,094,0 = J4,094-1,05 


Since, by (7), the product 9i,19is1,19i+1,09i,0 is independent of i, we deduce, 
as a consequence of (11), the following relation: 


(11’) 94,19 = (a ae Qc 1, 


We proceed to prove that in the reduced complete set of generating relations 
of Hn given above, the infinite set of relations (8,) can be replaced by the 
relations (11) and (11’), i.e. the relations (8,) are group-theoretic con- 
sequences of the remaining generating relations and of (11) and (11’). 


Proof. We denote by the product gz: Yn-29?n-19n-2° OF, in 
view of (7), 
(12) T= (9119 141,19 i+1,09i,0) 
We have 
T* = = 819 io 

hence 

Substituting into (9’) we get 

= gis (78:1) 
and hence, using a new symbol fi; for the transforming elements in (81), 


we have 


(13) Big = = 945 (781) 

By (5b) each element is commutative with 7, since r= g?n-1° * * Joy 
hence + is also commutative with Hi jHr i.e. in view of (7) (which is a con- 
sequence of (7’) and (9’)), 7 is commutative with gijrgi;: 

(14) (79:3)? = (gist). 


We use the following relations: 
(15) (781) 9755 [by (10)] 


(15’) 784 (791) (78i41) = T9127 Jis [by (10), (12) and (14) ] 
(15) 784 == (74,5417 ) TS: [ by (10) and (14) ]. 


To prove (15’) and (15”) we proceed as follows: 


SiTgist = [by (10) ] = gist gas; 
hence 


348 OSCAR ZARISKI. 
and 


From (15), (15’), and (15”) we deduce for any integer k = 0 the following 
relations: 


In an exactly similar manner the following relations can be verified: 
(r8i+1) Jio(75i) = 
TSi = TJioTGJi,-1- 
From these relations and from (15”) we deduce for any integer k < 0 the 
following relations: 
(17) Yi,2k (78; ) (78i+1) 94.07 I 403 


Using the relations (16), (16’), (17) and (1%) and taking into account the 
relations (8,) and (14), we obtain easily the following relations: 


(18’) Vi, Vi, 2ke1 
We put 


= 
so that, by (13), 
Sin, 


Ji, ok+1 i,2k+194,ke1- 


In view of (18) and (18’), the relations (8,) will follow at once, if we estab- 
lish the following relations: 


(19) iodik = Ji+2, 2k, 
(19’) Gist = Jis2,2k-1, 


Since dio and = Sin J the relation (19) for k =0 
coincides with (11) and the relaticns (19’) for k =1 coincides with (11’). 
Hence in order to establish the relations (19) and (19’) for all & it is sufficient 
to show that if they hold true for a given k, they also hold true for k + 1 and 


for k—1. Now 


84, 8ikJi+1,0 (9 i+1,191+2,19 i+2,09i+1,0) Ji+1,1 


— Six (Gi+2,19i+2,0) = 


| fc 
4 
0 
m 
de 

(2 
Ge 
(2! 
(2; 
(2 

(2 
(2¢ 
(2! 


le 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN SURFACE. 349 


Hence, assuming (19) and (19’) for a given value of k, the same relations 
follow also for k + 1 and k —1 in view of the relations 
442,58) = GJi+2,j-2e, e=it1 


which are direct consequences of the relations (8,), q. e. d. 


7. We now complete the elimination of the elements ai; from the 
generating relations of H, by proving that also the commutativity relations 
459k = Jn%ij, k > 2, (5b), are consequences of the remaining relations, to wit, 
of the relations (6), (6”), (82), and (7”). This is obvious if k > 3, since the 
aij depend ofly on the gi;’s [see (9’)] and since, by (6), the gi;’s are com- 
mutative with g;,- --,9n-1. By (7), which is a consequence of (10) and of 
the relations (9’) which define the elements a;;, we have 


Since #9 = 1, it is sufficient to establish the commutativity of Hija and ge. 
Now, using the relations (6) and (6””) we find: 


and this proves our assertion. 
Reassuming the reduction carried out so far, we have that our group Hz is 
defined by the following set of generators and generating relations: 


Generators: 
(20) (4,7 =0,+1, + 5 Yn-1- 


Generating relations: 
(22) Ji+2,0 = 94+1,094,09 
(22,) Ji+2,1 = 


(23) = 9291192 


(23:)  gisge = (k = 3,--+,n—1) 
(24) = (k = 2,3,- + -,n—2) 
(24:) 


(25) 9419 141,19 i+1,09i,0J2° Jn-29?n-19n-2 


1e 
) 
— 


350 OSCAR ZARISKI. 


The existence of a finite set of generators follows now readily. In fact, the 
relations (21), for a fixed value of 1, can be considered as recurrence relations 
which define the elements gi; in terms of the two free elements gio and gi}. 
Then the relations (22) and (22,) can be used in order to express all the 
elements gio and gi; in terms Of goo, Jio ANd Yor, Ji1, Tespectively. Consequently 
our group H, 1s generated by the n+ 2 elements: | 


(26) Joos Jory Jits Jas’ 9 Yn-1- 


The reduction of the infinite set of relations (23,) is trivial: since all the gi;’s 
are expressible in terms of goo, Yio, Yo1, 911, all the relation (23,) are con- 
sequences of the four relations 


(232) Gk = (i,j =0,1). 
The reduction of the relations (23) is based on the following 
Lemma. If four elements a, b, c, x satisfy the relations 


and if d = cbc‘, then the above relations have as a group-theoretic consequence 
the relation rdz = dad. 


Proof. Lete—=-+1. Then 


bexeb = ba cab" = bab abrb = 2b 
= a = x 


Hence 
The case «= — 1 is reducible to the case e = + 1, by the substitution 
Putting a = gij, b = gi,js1, C = 91,542, € = — 1, we deduce from the above 


lemma, in view of (21), that for a fixed i, the relations (23) relative to the 
indices j, j + 1, 7 + 2 imply as a group-theoretic consequence the relation (23) 
for the index j + 3. Similarly, if we put a = gi,j.., b = giju, C= €=1, 
we find that the just mentioned three consecutive relations (23) also imply the 
relation (23) for the index j —1. It follows, that for a fixed 1, all the relations 
(23) are consequences of any three of them relative to three consecutive indices, 
say = 0,1, 2: 


| 


ce 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN SURFACE. 351 


(27) ii = (7 = 0, 1,2). 


Now, in view of (22) and (22,), we conclude in a similar manner, on the basis 
of the preceding lemma, that for 7 = 0,1 the relations gijgogij = gegijge are 
consequences of three of these relations relative to three consecutive valus of i, 
say 10,1, 2. It remains to consider the set of relations gisgegie = 
Using the relations (27) for 7 0,1 and the expression of gi. derived from 
(21), for 7 0, we change the above relation into an equivalent relation as 
follows : 


94292912929 i292 = 9419 109 119.29 119 40911 109 11 
= Ji19i0G 29 109.29 419 29 109 = (GisGiog2)? (G29 irgio) 


Hence the relation gi2929i2 = Y2Ji2g2 can be replaced by the relation 


(28) (929 i1Jio)?. 


Now, it is not difficult to see that the expressions 04 = (Gi19iog2)?(929i1Jio) 
are all transforms of each other, for 10, +1, + 2,:--, as a consequence 
of the relations (27) (7 =0,1), (23,) and (25). In fact, let 


so that, by (23,), we have gij8 = 89i;._ By (25), we have 
91+1,19i+1,0 = 


Hence, substituting into o;,, we find 


= 951928 910g 2941 1089 29 11929 28929 i192 
= (gogirgio) * 11892g 41 


where = obviously a transform of oi. 

Hence, any one of the relations (28) implies as a consequence the entire 
set of these relations. We shall take the relations relative to 1 = 0. 

We finally observe that, in view of the relations gi:9is,1 = Jis1,19i+2,1 and 
9i+1,09i,0 = Jis2,0Jis1,0 [ (221) and (22) respectively], the infinite set of rela- 
tions (25) reduces to one relation, say relative to 1 = 0. 

Reassuming, we have the following result: 


The group H,, is defined by the following set of generators and generating 
relations : 


he 
ns 
41° 
he 
ly 
n- 
e 

e 

8 


352 OSCAR ZARISKI. 


1. Generators: 


2. Generating relations: 


= 9291192 (t, 7 = 0,1). 
(30’) = (G29109oo)* 
(30”) = (92901911)? 
(30’") (Go19oog2)? = ( 92901900)” 


(31) = JuJij, 


(32) Ge = (k = 2,3,- -,n—2) 


(32’) |k—1|A1 
(33) 901911910Jo0oJ2° Gn-29°n-19n-2 i, 


The relations (30’), (30”) and (30”’) are the relations (27) relative to the 
following values of the indices 1,7: 1 2,7 1=0, j=2, 
after the expressions of goo, J21, Jo2, given by (22), (221) and (21) respectively, 
are substituted. 


Remark. If we change our notation as follows: 
(34) Jor =r, = Az, Gio = As, Joo = As; 
then we see that the relations (30’), (30”), (30) are of the form 
(35) = (AiAjg2)?, 1,7 =1, 2, 3,4. 


An easy verification shows that the six relations (35) all hold true. For 
t=3, j=2; 1=1, j they coincide with the relations 
(30’), (30”), (30’’) respectively. For i = 2, = 83 the relation (35) coincides 
with (28) fori—1. The reader will easily verify the truth of the relations 
(35) in the remaining two cases. 


III. The fundamental group of plane elliptic curves. 


8. What is the geometric significance of the invariant subgroup Hn of 
Gn? We proceed to show that H, is the fundamental group of a certain sub- 
space of k” — D (see section 1). 

Let us consider R as the Riemann surface of some algebraic elliptic curve f. 
This curve carries, for every n, a simple infinity of complete linear series gn”” 
of dimension n — 1, and each set of n points of f belongs to one and only 


I 
I 
li 
a 
al 
(1 
| te 
T 
ti 
| " 
| it 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN SURFACE. 353 


one series gn""*. The simple infinity of these series is an elliptic one-dimensional 
variety, birationally equivalent to f: in fact, each series contains a unique set 
of n points of which n —1 are preassigned fixed points P,°,- - -,P%n-1, and 
then the n-th point P of the set determines the series uniquely. 

Since a g,""* is represented on FR” by a space homeomorphic to a linear 
space of n — 1 dimensions, we conclude that the algebraic variety R” contains 
an elliptic pencil {Sn.} of linear (n —1)-spaces Sy, free from base points. 

Let Vn_2 be the intersection of an Sy_, with the discriminant variety D 
of R”. We assert that H, is the fundamental group of the residual space 


Sn-1 N-2° 


Proof. We take the origin O of the fundamental group to be a point of 
Sn-1— Vn-2. We have to prove that: (1) a singular 1-sphere y on O and in 
k" — D represents an element of Hn, if and only if it can be deformed over 
kn — D into a 1-sphere y’ contained in Sn-; — Vn-2, the point O being fixed; 
(2) if y is already in Sn_, — Vn_, and if it bounds a singular 2-cell on Kk” — D, 
then it also bounds a singular 2-cell on Sn_;— Vn-z. 

Let wu be an elliptic integral of the first kind attached to the curve f. 
It is well known that f admits a continuous one-parameter group of birational 
transformations z; into itself, represented analytically by the equation 
wu’ =u-+t (mod. periods). Each transformation 7; of the group is an auto- 
morphism of #. There is an induced automorphism of Rk”, which we shall 
also denote by z; and which is at the same time an automorphism of the 
residual space 2” — D, since zm; transforms sets of n distinct points of R into 
sets of nm distinct points. Each a; permutes the linear spaces S»_, (images of 
linear series on f) and induces a birational transformation o, into itself (an 
automorphism) of the elliptic pencil {Sn_.}. If we put 


+ +: -+U(2n), 


Where 2,,° - +, 2» is an n-tuple of points of then v is a simple integral 
attached to R” and v reduces to a constant on each member of the pencil Sy_, 
(theorem of Abel). Hence v is also an elliptic integral of the first kind attached 


to the pencil {S,,_,}, and the transformation o; is given by the equation 
=v+r, r=nt. 


The group of the transformations 7; covers n? times the group of transforma- 
tions since to the identity correspond the n* transformations Where 
o/n is the n-th of a period of u. Since the covering is free from branch points, 
it follows immediately that any variation of Sy, in the pencil {Sy} can be 


e 
} 
Is 
8 
1S 
f 
-1 
y 


354 OSCAR ZARISKI. 


accompanied by an isotopic deformation of the variable Sn.+, and in such a 
manner that also the residual space is deformed isotopically. This isotopic 
deformation of Sy_.— Vn_2 is simply effected by a convenient chain of trans- 
formations 7, 0 =¢=1, applied to the initial position of the Sn. From 
this last statement we derive immediately the following conclusion. Let R’ be 
the Riemann surface of the elliptic pencil {Sn_,}. Every point P of K” lies 
on a definite S,_, of the pencil and is thus mapped upon a definite point P’ 
of R’. Similarly any point set A on 2" is mapped upon a point set A’ of PR’. 
A deformation of A on R” or on &” — D induces a deformation of A’ on R’. 
From the preceding statement we may conclude that, conversely, if A 1s on 
kn — D, then any deformation of A’ on K’ is induced by a deformation of A 
on R" — D, and that if a point P of A’ is fixed throughout the deformation 
of A’, then the points of A which are mapped upon P may also be assumed 
to be fixed during the deformation of A. 

Let y be a singular 1-sphere on #"—-D issued for the origin O of the 
group G,. From the definition of the group H, follows that y represents an 
element of H, if and only if the map 7’ of y on FR’ is a (singular) 1-cycle ~ 0. 
Assume that y’~ 0. Then 7’ can be contracted on FR’ to the point O’, image 
of O, and hence, by our preceding result, y can be deformed into a 1-sphere y, 
contained in Sn-1— Vn-2, the point O being fixed. Conversely, if y can be 
deformed into such a 1-sphere y,, then y’ can be contracted to the point 0’, 
y ~ 0, and hence y represents an element of Hy. 

Assume that y is in Sy_, — Vn-2 and that it bounds a (singular) 2-cell L, 
on k"—D. The map of £, on RF’ is a 2-sphere M’ containing O’, since the 
boundary of #, is mapped on the point 0’. On R’ any 2-sphere on O’ can be 
contracted to the point O’, this last point being fixed. Hence EF, can be 
deformed on #” — D into a 2-cell contained in Sn_; — Vn-2, the boundary y 
being fixed. This completes the proof of our theorem. 


9. The variety Vn-2 is an hypersurface immersed in the (n — 1)-space 
Sn1. Let S. be a general plane of S,_, and let C be the plane algebraic curve 
along which S, cuts Vn-2. By a theorem proved in Zariski,® the fundamental 
group H;,, of the residual space S»-1— Vn-» coincides with the fundamental 
group of S, —C. Now, a general plane S, in our Sn-1 is the image of a general 
series gn? immersed in the corresponding gn” of the elliptic curve. A point 
of S, represents a set of the series gn”. If we refer the sets of the gn” to the 
lines of a plane, we obtain the general plane elliptic curve I, of order ”, 
a birational transform of f, on which the sets of the gn? are cut out by the lines 
of the plane. The points of S. which are on C represent n-tuples of the gn’ 


—“~, ~ ~» 


4 
| 
i 
i 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN SURFACE. 3055 


with coincident points, hence correspond to the tangent lines of T. We con- 
clude that the plane curve C 1s the dual of a general elliptic plane curve T 
of order n, and that our group H, is the fundamental group of the residual 
space of C. 

The curve C is of order 2n, possesses k = 3n cusps and d = 2n(n— 3) 
nodes, and is the maximal cuspidal elliptic curve of its order. Of the 
generating relations of Hn, the relations (30), (30’), (30), (30’’), and (32) 
are all of the form aba= bab and arise from the cusps of C. The com- 
mutativity relations (31) and (32’) are due to the nodes of C. Finally, the 
relations (33) corresponds to the relation A,;Az° =1, where the 
are loops contained in a general line of the plane of C and surrounding the 
2n intersections of this line with C (see Zariski*). The relations As = Aen, 
As =Aen-1 etc., arise from the n tangent lines of C belonging to a pencil of 
lines (compare the analogous discussion of the singularities and of the corre- 
sponding generating relations in Zariski,* p. 615). By watching the effect 
which the removal of a cusp or of a node has upon the fundamental group, 
we arrive at conclusions relative to the fundamental group of any plane curve 
C’ which admits C as a limiting case, in particular of any plane elliptic curve 
of even order 2n, possessing only nodes and cusps (compare Zariski,* p. 616). 
Let us first remove a node, i.e. let us consider a node of C as virtually non- 
existent. For the fundamental group this amounts to replacing a com- 
mutativity relation ab = ba by the relation ab. We may assume that the 
relation thus affected is the relation gog4—=gsg2. We have then g, = gp. 
Since gos = Jsg2 and 949594 = 9594s, it follows g, = gs. In a similar manner 
we find = Js = Js =" = Since JijG29ij = ANA Gis Js = 991i, 
the relation implies the relation gi; Hence the fundamental 
group becomes a cyclic group of order 2n. Let us now remove a cusp, by 
converting the cusp into a node. A relation aba = bab will be affected and 
will have to be replaced by a= 6. We may assume that the affected relation 
is the relation goo92Joo = J2JooJ2. We have then gs = goo, after the cusp has 
been removed. If n = 4, we may use the relations goods = 9sJooy 929392 = 9929s 
and we then find that g2 = g,. We conclude as before that the group becomes 
cyclic. Hence, we have the following result: /f C’ is a plane curve of order 
’n > 6 with nodes and cusps, and if C’ admits the maximal cuspidal elliptic 
curve C', of the same order, as a limiting case, without being a curve C ttself, 
then the fundamental group of C’ is cyclic (of order 2n). In particular, every 
plane elliptic curve of even order 2n possessing less than 3n cusps has a cyclic 


fundamental group. 
In the exceptional case n = 3 we are dealing with the dual of a general 


a 
Ke 

ry 

n 
A 
nN 
ne 
in 
0. 
be 
J’, 
he 

be 

ce 
ve 

al 

al 

al 
nt 
he 

Nn, 

es 
in 


356 OSCAR ZARISKI. 


plane cubic, i. e. with an elliptic sextic having 9 cusps. We write the generating 
relations of H;, using (27) instead of the equivalent set of relations (30)- 


(30””) 

(34) 9159 291i = 929 (1,7 = 0,1, 2). 
(35) 9019119109 0092" = 1, 

where 


i093 
(36) | J2 = 9109009 54 
= 9519019 11- 


The 9 relations (34) are typical cuspidal relations, and one may conjecture 
that they correspond to the 9 cusps of the curve. However, since only 7 of 
these relations are group-theoretically independent, this. conjecture requires 
proof. The 7 independent relations are given by (30), (30’), (30), (30”’) 
and correspond to the following values of 1, 7: 


i,j 0,1; t—2,f—0; i—2jf—1; i—0,j—2. 


At present we can only assert that the seven independent relations are relations 
at 7 of the cusps. We recall that the well known group of 9 flexes of a cubic 
curve is doubly transitive. Hence if we remove a certain number of cusps of C, 
it is immaterial which cusps are removed, as long as the number of removed 
cusps does not exceed 2. 

We remove successively the two cusps which give rise to the relations 
90092900 = 92900925 91092910 = J29iogz. After the removal of the first cusp we 
have Joo = gs. The relations (30’) and (30”’) become then consequences of 
the relations (30) for 1—1, j7=0 and 10, j =1 respectively, while the 
relation (30) becomes a consequence of (33). Hence the fundamental group 
of a sextic with 8 cusps (and with one or no double points) is generated by 
4 elements 

Joris Jit» Jo 
satisfying the relations: 
= 
90191191092" = 1, 


If we now remove the second cusp, we get gio = g2, and thus the fundamental 
group of a sextic with 7 cusps and of genus = 1 is generated by 3 elements 


Jo1s 


7 
| 
i 
j 
| 
| 


ng 
))- 


~ 


THE TOPOLOGICAL DISCRIMINANT GROUP OF A RIEMANN SURFACE. 357 


satisfying the 3 relations: 
Jo192Jo1 = 
91192911 = 9291192 
= 1. 


This is also a group generated by the two elements 


U = 911929115 V = 91192 


satisfying the relations u? = v* — 1. 


It is known that this group is also the fundamental group of a sextic 
with six cusps on a conic (Zariski*). Hence there must be among the seven 
cusps left, a third cusp whose removal has no effect on the fundamental group. 
It is easily seen, by using the relations (36), that if the removal of an addi- 
tional cusp yield the relation gi; = g2, 1A 2, 7 #0, the group becomes cyclic. 
Qn the contrary, if the removed cusp gave rise originally to the relation 
92092920 = J292092, the group is unaltered, since the relation goo = gz is already 
implied, in view of (36), by the removal of the first two cusps (goo = 910 = 92). 

It is known that the sextics with six cusps distribute themselves into two 
distinct continuous systems, according as the six cusps lie or do not lie on a 
conic. The preceding considerations lead therefore to the conclusion that the 
fundamental group of a sextic with six cusps not on a conte is cyclic (of 
period 6). 

One can verify the following: if 9%,j,, Jisj.. Gisi, are any 3 of our nine 
elements gi; such that i, + i. + 13 ==; + jo + js =0(3), then the removal 
of the three corresponding cusps (whence the addition of the relations 
= = = G2) leads to a curve whose fundamental group is the 
above mentioned group of a sextic with six cusps on a conic. If, however, the 
above congruences do not hold true simultaneously, then the removal of the 
corresponding cusps leads to a curve with a cyclic fundamental group. What 
we have here is obviously something which adds topological significance to the 
configuration of the 12 MacLaurin lines determined by the nine flexes of a 
cubic curve. It is known that if the nine flexes are distributed into three 
triples lying on three MacLaurin lines, then the six flex tangents of any two 
of the triples lie on a line conic. Dually, any two of the corresponding triples 
of cusps lie on a conic. If then the three cusps of one triple are considered as 
Virtual non-existent, the resulting sextic must have six cusps on a conic. 

This proves incidentally, that the nine relations (34) reproduce exactly 
the relations at the nine cusps. 


re 
of 
"eS 
) 
ng 
ic 
C, 
ns 
we 
of 
he 
p 
by 
al 


358 OSCAR ZARISKI. 


10. We conclude by pointing out that the reasoning employed in our 
paper,* section 7, can be applied also in the present case to elliptic curves of 
odd order and leads to the conclusion that the fundamental group of such a 
curve (with nodes and cusps) is always cyclic. For the proof it is sufficient 
to consider the maximal cuspidal elliptic curve Con,,, of order 2n + 1, and to 
observe that Con,, can be degenerated into the maximal cuspidal elliptic curve 
Con and into a line p tangent to Con. The fundamental group of Con + p can 
be obtained from the fundamental group Hn of Con by adding an extra generator 
y and the relations (yg2)* = (gzy)*, = = guy, k > 2. We obtain 
the curve Cons by considering the tacnode of Con + p at the point of tangency 
of p, as a virtual cusp. As a consequence, we replace the relation (yg2)* = (gzy)? 
by the relation gy, and from this follows immediately that the group of 


Cons1 18 cyclic. 


THE JOHNS HOPKINS UNIVERSITY. 


REFERENCES. 


1K. Reidemeister, “ Knoten und Gruppen,” Abhandlungen aus dem Mathematischen 
Seminar der Hamburgischen Universitat, vol. 5 (1927). 

20. Zariski, “On the problem of existence of algebraic functions of two variables 
possessing a given branch curve,” American Journal of Mathematics, vol. 51 (1929). 

0. Zariski, “ A topological proof of the Riemann-Roch theorem on an algebraic 
curve,” American Journal of Mathematics, vol. 58 (1936). 

* 0. Zariski, “On the Poincaré group of rational plane curves,” American Journal 


of Mathematics, vol. 58 (1936). 
50. Zariski, “ A theorem on the Poincaré group of an algebraic hypersurface,” 


Annals of Mathematics, vol. 38 (1937). 


j 

| 


our 
3 of 
ha 
ent 
1 to 
can 
tor 
ain 
ncy 
y)? 


ON THOSE POINTS OF AN ALGEBRAIC MANIFOLD NOT 
REACHABLE BY A GIVEN PARAMETRIC REPRESENTATION.’ 


By J. F. Daty. 


Let K denote the complex field, and let 7,,- + -,@» be elements of an 
algebraic extension K(t;,---,tm, of the pure transcendental 
field K(t:,---,tm). A set of complex numbers tm, 
will be called allowable if every polynomial /’(t,2) which vanishes as an 
element of K(t:,: %1,° *,%n) vanishes also when 2% are sub- 
stituted for ¢;, x; respectively. The totality of points whose codrdinates 
{v’1,: * +,@n} belong to allowable sets will be contained in some smallest 
algebraic manifold $t. This manifold is said to be represented parametrically 
in terms of the ?’s. 

In general 9 will contain points which are not allowable. But in case 
the parameters are merely some 7 of the codrdinates themselves it follows 
readily from a theorem of Ritt (1) that such exceptional points are always 
limit points of allowable points. It is the purpose of the present paper to 
extend the above result to all representations, whether the parameters ¢ are 


essential or not. 


THEOREM. Jf an algebraic manifold M is represented in terms of any 
parameters whatever, the base field K being the complex field, then each point 
of M is a limit of allowable points. 


We shall treat in detail only the case in which the parameters ¢,,° - -, ¢m 
are all algebraically independent over K ; but the method of proof is quite the 
same if the representation involves additional parameters, say Ums1,° °°, Us, 


dependent on the ¢’s. Since each z is algebraic over any extension of 
K(t,,-+-+, tm) we may write the irreducible equation for over K tm), 
the irreducible equation for 2g, over K(t,,: *,tm,%a,), ete., each divided 


through by its leading coefficient : 


tq, + a1 (ti, tm) da(h,° tm) = 0 


(2) 


1 Received November 12, 1936. 
359 


en 
les 
val 


360 J. DALY. 


the order 2q,,° ° *,%q, to be determined later. Although the coefficients 
ai,* * *,¢4 are in general rational functions of both z’s and ?’s, they may be 
made polynomials in the «zs; the denominators will then involve only 


th,: * *,tm. Let a non-vanishing common multiple of all denominators be 
Any set of complex numbers 


satisfying («) together with the relation A(t;,- - -, tm) 0 is allowable (2). 
The theorem will be proved by showing that for any point {27’1,° + +, 2n} 


of Mt there is a neighboring point {z,,- - -,@,} of to which we can assign 
parameter values ¢,,- - -,¢m in such a way that the set +, tm, %1,° @n 


satisfies (a) with A(t,,- tm) ~0. 

For this purpose we choose a new transcendental basis (3) of 
K(t,,° tm, %1,° ° Every element of - - -, tm, is 
algebraically dependent on the ordered set +,@n, +,¢m. If from 
this set we select those elements which are algebraically independent (over K) 
of all preceding elements, we obtain a set = having the following properties: 

(a) the number of elements in & is exactly m; 

(b) every element of tm, is algebraically de- 
pendent on 3; 

(c) the elements of = are algebraically independent over K. 

Let the elements of be * 5 After suitably renum- 
bering the 2’s and ?’s, we may assume that contains -, tm-r, 
and that equations («) have been calculated correspondingly. 

The field +, tm, %1,° *,%n) can now be regarded as an algebraic 
extension of the pure transcendental field t1,: *,tm-r). We 
may therefore write the irreducible equation satisfied by 2,,; over K(2,°°*, 2, 
ti," *,tm-r), the irreducible equation satisfied by 2,4. over +, 2r, 
bm-r, etc., each divided through by its leading coefficient. Note 
however that no ?@’s will appear in the coefficients of the equations for 
for the existence of a relation -,2n, t1,° tm-r) =0 
which actually involved some ¢; € & would imply the dependence of that ¢, on 
the set t1,° °°, te-1, tes,’ °°, tm-r and therefore on the set 
* 5 Ary test,’ *,tm-r, Which is impossible. The equations 


under consideration then take the form: 


The denominators of the various coefficients need involve only 2,,° - -, xr; let 


a non-vanishing common multiple of all denominators be B(2,,- <r). 


i 
| 
. . 


POINTS NOT REACHABLE BY A GIVEN PARAMETRIC REPRESENTATION. 361 


ts Continuing, we write the irreducible equation satisfied by tm-ri1 over 
be K(a1,° *,@r, *,tm-r), the irreducible equation satisfied by tm-r.2 over 
ly K(a1,° +, tm-r, tm-ri1), etc., each divided through by its leading 
be coefficient : 

/ 

| 

(y) 

Un + h, (2%, ti, Um-rs +- ti, tm-1) =). 
of Only %,° +, ¢m-r need occur in the denominators of the coeffi- 
is cients. Let a non-vanishing common multiple of all denominators be 


and (y) with B- CA 0 will be an allowable set, and will therefore satisfy the 
equations resulting from (#) on multiplication of each of the latter by 


A(t,,--+, tm). It remains then, to ensure the non-vanishing of A tm). 
Now as an element of K(t,,: +,tm, %1,° A satisfies some irreducible 
equation over °°, @r, t1,° tn-r): 

ry As + Pr(@1,° Ur, tm-r) i) = 0. 

ic Let t1,° *,¢tm-r) be a non-vanishing common multiple of 

e all denominators and of the numerator ps of the last coefficient. Then 

ty ++, mr) 0 implies tm) 540 for any 

allowable set 2’,,- --,2’r, tm. Thus the non-vanishing of the poly- 

e nomial D=V(a,,° +, t1,° *,¢m-r) implies the non-vanishing of 

rr all denominators so far considered. 

0 We may arrange V according to power-products of the ¢’s, and take a 

n non-vanishing common multiple P(z,,---+,2,) of the resulting coefficients. 

t Now P does not vanish everywhere on the irreducible (2) manifold M, since 

otherwise it would vanish identically as an element of ++, tm, %1,°**52n). 
Let {2',,- - -,2’n} be any point of M. If P(2’,,- - -, an) =0, then by Ritt’s 
theorem that point is a limit point of points {%,,- - -,a@n} of MM for which 
P~(). Suppose therefore that #0. Now equations 
after multiplication by B(a,,: - -,a,r) are satisfied by all allowable values of 


the 2’s, and thus constitute part of the equations defining the manifold M. 
But at the point under consideration B= 0, so that its codrdinates actually 
satisfy equations (f). 


362 J. ¥. DALY. 


The independent quantities ¢’,;,- - -, t’m-r may be chosen in such a way 
that °,tmr) 0. Using these values - -,2’,, 
*,tm-r, we may calculate successively the remaining ?’s from (y). 
The set +, 2’n, +, tm, being allowable, will satisfy equations (2), 
for by construction +, lm) ~0. 

Thus any point of Mt at which P ~0 may be obtained directly from the 
original parametric representation ; and any other point of Mt is a limit point 


of points thus reachable. 


PRINCETON UNIVERSITY. 


REFERENCES. 


(1) J. F. Ritt, “ Differential equations,” American Mathematical Society Colloquium 
Publications, vol. 14 (1932), p. 91. Also B. L. van der Waerden, “ Zur 
algebraischen Geometrie III,’ Mathematische Annalen, vol. 108 (1933), pp. 
694-698. 

(2) For notation and definitions cf. B. L. van der Waerden, Moderne Algebra, vol. II, 
pp. 51-61; or B. L. van der Waerden, “ Zur Nullstellentheorie der Polynom- 
ideale,” Mathematische Annalen, vol. 96 (1926-27), pp. 183-208. 

(3) B. L. van der Waerden, Moderne Algebra, vol. I, pp. 204-206. 


| 
| 
| 


A REMARK CONCERNING THE PARAMETRIC REPRESENTATION 
OF AN ALGEBRAIC VARIETY.’ 


By Oscar ZaRISKI. 


In his paper “ On those points of an algebraic manifold not reachable 
by a given parametric representation,” published in the present issue of this 
Journal, Mr. J. F. Daly treats the case in which the number of parameters 
in a given parametric representation of an algebraic variety exceeds the di- 
mension of the variety; nor need the parameters belong to the field of algebraic 
functions defined by the variety. While this type of parametric representa- 
tions is more general than the one treated heretofore explicitly in the literature 
(see, especially, van der Waerden, “Uber irreduzible algebraische Mannig- 
faltigkeiten,” Mathematische Annalen, vol. 108 (1933)), it may be pointed 
out that the generalization given by Daly can also be obtained by using some 
simple properties of rational transformations of varieties. The following proof 
is taken from the mimeographed notes of the algebraic geometry seminar con- 
ducted by Professor Lefschetz and myself in Princeton, 1934. 


1. Let V be an irreducible algebraic r-dimensional variety in Sm(41,°**, Ym) 
and let 


(1) Bu (yi) = Palys) /Q (ys), (k= 1,2,---,n) 
be the equations of a rational transformation of V into an algebraic variety W 
in Sn(a1,° * *,2n). We assume, of course, that Q@0o0n V. The codrdinates 


Nm Of a generic point of V are elements of a field OQ = K(m,°* +, 9m) 
of algebraic functions of r independent variables, where K is the field of com- 
plex numbers. The codrdinates of a generic point of W are & = Rx(m) and 
define a field 2’ = K (&), OQ’ =. If p (Sr) is the degree of transcendentality 
of 2’, then W is of dimension p. 

Let g(¥:,° °°; Ym) be some polynomial which does not vanish identically 
on V, and let 7 denote the set of points of W which can be obtained directly 
from the equations (1) and which correspond to points of V at which g ~ 0, 
i.e. points of W which correspond to points of V at which Q 0 and g 0. 
We prove that W is the closure of T. 

From the theory of fields it follows that OQ = ’(t,,: - -,ts,7),s +p=r", 


1 Received November 16, 1936. 
363 


way 
Y). 
the 
int 
um 
Zur 
pp. 
m- 


364 OSCAR ZARISKI. 


where the ¢;’s are algebraically independent over ©’ and where y satisfies an 
algebraic equation f(&, ti,7) = 0 with coefficients in K. Let 


where S; and M are polynomials, and let 


N(Q) = L(&, t1)/M(&, ti), = N(g) = G(&, tr) /M (&, tr) 


be the norms over K (&,t:) of Q(m,° and of g(m,° Yrespec- 
tively. Here M(&,t:) #0, and also L(& ti) #0, G(&, ti) 40, since 
Q(yi) #9 and g(ni) 49, by hypothesis. Let then 2° be a point of W at 
which the polynomials in ¢: M(x, t:), L(a%, ti), t:) do not vanish 
identically. If (¢:,---,#s) is any set of values of the ¢’s at which these 
polynomials do not vanish, and if 7° is a root of f(x%, t°1,y) = 0, then 
= Si (xx, t°1, 4°) /M are the codrdinates of a point of V at which 
Q~0, g and moreover = That is, any point (xz) of W 
at which none of the polynomials M (2, t), L(2,t), G(x, vanishes identically 
belongs to the set 7. It follows then by a theorem of Ritt that every point 
of W is a limit point of points in 7, q. e. d. 


2. Let the codrdinates z,,---,2, of a generic point of an algebraic 
p-dimensional variety W be algebraic functions of r parameters t,,- - -, t,, 
independent over K. The variety W is a rational transform of the r-dimensional 
variety whose generic point is (2, %,° - +,%n,t1,:°-°,t-). We identify this 
variety with the variety V of the preceding section. For the variety V the 
parameters ¢; are merely some of the coordinates, and the points of V which 
cannot be reached by this parametric representation satisfy a certain equation 
g =0, where g = g(a, t) is a polynomial not identically zero on V. By the 
preceding section it follows that the points of W which cannot be reached by 
the given parametric representation are limit points of reachable points. 


THE JOHNS HOPKINS UNIVERSITY. 


| 
| 


an 


ON THE CONSTRUCTION OF SYMMETRIC RULED SURFACES. 


By ArnoLtp Emcu. 


1. Introduction. Let S3(x) denote a projective space of three dimensions 
with the homogeneous variables 21, and 1, $2, $3, 64 the four ele- 
mentary symmetric functions on these variables, then 


(1) = 0, 


with the n; as positive integers, and n, + 2n2-+ 3n3 + 4n,==n satisfied 
represents, by definition, a symmetric surface. Noted examples are the Cayley 
cubic 31/z; — 0 and Clebsch’s diagonal surface. The writer has investigated 
surfaces of this kind in a number of papers. Two symmetric surfaces intersect 
obviously in a symmetric space curve, which is left invariant by the symmetric 
group G4 of collineations, since the surfaces producing it are invariant. In 
this connection I mention the discovery of a remarkable sextic of genus four 
which lies on 10 cubic cones.? 

It is naturally interesting to enquire about the possibility of symmetric 
ruled surfaces. It is clear that such surfaces exist, because ¢$,7 + Adz = 0 
represents a pencil of symmetric quadrics (admitting imaginary rulings). 
Then, of course, there is the special class of symmetric cones. If we admit 
the existence of general symmetric ruled surfaces, then by G24 a generic 
generatrix determines immediately 23 more on the surface. The first question 
then is what is the lowest order of surface on 24 lines belonging to the group 
G24. Choose a generic line J, determined parametrically by 


(2) pr, = + rb; (1 1, 2, 3, 4), 


with (a) and (b) as two arbitrary distinct points. Substituting (2) in (1), 
1 will lie on (1) when (2) is identically equal to zero for all values of 4. 
This leads to an equation of degree n in A, which is satisfied for all values 
of X when its n + 1 coefficients vanish. Hence the number of coefficients in 
(1) must have a number of coefficients (effective) = n+ 1. The experimental 
solution of the Diophantine equation gives for the orders n = 1, 2,3, 4,5, 6,7,-- - 
the number of effective constants of (1): 0,1, 2,3,4,7,9,---. This shows 


1 Received October 21, 1936. 
2“ ber eine besondere Raumkurve sechster Ordnung,”’ Monatshefte fiir Mathematik 
und Physik, vol. 40 (1933), pp. 193-200. 
365 


nce 
at 
ish 
ese 
1en 
ich 
W 
ally 
int 
alc 
tr, 
nal 
his 
the 
ich 
ion 
the 


366 ARNOLD EMCH. 


that the first case in which the inequality is satisfied is for n 6. There is 
still one constant available. Hence 


THEOREM 1. On a generic line | in S; there is a pencil of symmetric 
sextic surfaces. There are no such surfaces of lower order on lI. 


For general ruled surfaces one must look for higher than the 6th order. 
To solve this problem we use the parametric representation. 


2. Construction of ruled symmetric surfaces. Consider the equations 


pr, = (Az, Az, Ay) + (Az, Xs, Aa), 
= *2(A1, As, Aa) + (Ax, As, dz), 
pts = $*3 (Ai, Aa) + (Ax, Az, da), 
pls = Az, As) + (Ar, Aa, Az), 


(3) 


in which $*(a, B,y) and y*(a, B,y) are symmetric polynomials in 4g, £, 7, 
from which the ¢*; and y*; are obtained by replacing a, B, y by A1, Az, As, Ag 
as indicated. Then the ¢*; and y*;, except as to a possible change of signs 
throughout, permute in the same way as any chosen permutations of the 2’s. 
Hence a permutation 


induces the permutation q 
Aa Ax At Ag pL pL pLy 


in which p=+1. For a definite set of values for the )’s the ¢*’s and y*’s 
in (3) represent two points in S;, and with » variable (3) represents a straight 
line 7. Thus 


THEOREM 2. On the application of Gey to a definite set rx, Az, As, Ag and 
with » variable (3) represents 24 lines l;. Hach of these is met by six others 
of the same set. 


To prove the second part of the theorem, notice that a line 1; cuts the six 

planes 2; — a —0 in six points Pj. By the substitution (ik) this Piz is not 

changed, but J; is transformed into that of set of 24 which corrresponds to (ik). 
To (3) we now adjoin 


(4) M (A, As, As) — 0, 
N(A1; da) — 0, 

two equations of degree m and m and symmetric in the d’s, and also homo- 
geneous in order to make them of dimension 3. Then in a S3(A)-space (4) 


f 
i 


ON THE CONSTRUCTION OF SYMMETRIC RULED SURFACES. 367 


Te is represents two surfaces which we assume to intersect in a complete irreducible 
curve Cmn. To every point (A) of this curve correspond by the Ges 24 points 
(including the point itself) of the same curve and by (3) 24 lines 1;. As (A) 
describes Cmn, the 1,’s generate a ruled surface R whose genus is the same as 
that of Cmn, because there exists a (1,1) correspondence between the points 
‘der, of Omn and the generatrices of the ruled surface in S;(a). To determine the 
order of R, the d’s and » must be eliminated from (3) and (4). Denote 
$*i (Aj, Ax, Ar) by $*;, then elimination of » from (3) gives 


etric 


ns 
(5) (4) + (P* — + — ) == 0, 
(b) + La p*op*s) + — = 0. 
These are equations of degree r +s in the A’s. Elimination of the ’s gives 
a symmetric equation of degree mn(r-+s) in %,%3,%, and of degree ® 
mn(r s) in 22, £3, hence of degree mn(r + 8) in 2, 2, Xz, If however 
B, ¥, equations (4) and (5) have & points in common which are in the same order 
ay Ag a, 8, y,8-fold four the four equations, then the resultant is of degree 
igns n(r + s)* — kByé in the coefficients of M; n(r + s)? — kayé in the coefficients 
of N; mn(r +s) in the coefficients of (5a); mn(r +s) — kaBy in 


the coefficients of (5)). When &8=vy, then the resultant (reduced) is of 
degree mn(r-+s) —kaBy. Hence 


THEOREM 3. The parametric equations (3) in conjunction with (4) and 
y*s the indicated multiplicities represent a ruled symmetric surface R of order 


ght mn(r +s) — kaBy. 

3. Example of octic ruled surface. If (3) has the form 
und pr, = + 

ers = AiA3A4 +- pro, 


pls = + PAs; 


six 
and 
not 
N 
k= 4 (vertices of codrdinate tetrahedrons), B=2, then 
*The resultant of equations (4) and (5) according to the conventional methods 
is of degree 2mn(r +s), but reduces to the degree mn(r + 8) after deleting extraneous 
10- factors. This is verified by means of the (1,1) correspondence which exists between 
4) the curves C,,,. and C,,,,, of indicated orders, obtained in a A-space by the mapping of 


(M N) by means of the ¢*’s and y*’s. 


368 ARNOLD EMCH. 


the order of R becomes 3:2-4—4-2-2=8. Ordinarily the elimination 
of the A’s and » presents great difficulties on account of the labor involved, 
In this example this task may actually be accomplished as follows: Let 
$1, 2, $3, ps now stand for the elementary symmetric functions of the variables 
xz, and L,, Lo, L3, L4 for the »’s. Then 


Ls + ply 
(8) = + pls (L,L, 3L,L,4) + pe Ls 
p*hs + pL? (L,? 212) 


When (A) lies on the intersection (, (rational with double points at the A;) 
of L, = 0, (8) reduces to 


p oi = ply, 

= — 

= + ph + + p*Ly. 


(9) 


Eliminating p, », L,, L, from (9) we obtain the octic ruled surface R: 


(10) — — + 461°G2° — 164175” 
+ bobs — + = 0 


It cuts the unit plane ¢, = 0 in the conic (¢; = 0, ¢2 C2 and contains 
the three diagonal lines (¢, = 0, ¢; = 0) as double lines. If P(A) les on Cs, 
then by the cubic involution pA’; = 1/Ai, P(A) goes into P’(1/A,, 1/A2, 1/Az, 1/As) 
on C.. The join of P’P 


which lies in the intersection of Z, and Jz, i.e., 


interpreted in S; is a generatrix of R. Thus 


TurorEM 4. The locus of joins of corresponding points in the cubic 
involution T in which C, and C, are corresponding is an octic symmetric ruled 


surface R. 


This can be verified by the principle of correspondence: Let g be a generic 
line in S, and («) the pencil of planes on g. An @ cuts C, in six points B: 
to which correspond by T six points B’; on C., which joined to g give six planes 
a’. Conversely to a plane « which cuts C, in two points B’; correspond by T 
two points B; on C., thus determining two planes a. This establishes a (6, 2)- 
correspondence between the planes a and @ with 6+ 2=—8 coincidences. 
Let «* be such a coincidence, then there lie on this plane two corresponding 


ion 
ed, 
Let 
leg 


ON THE CONSTRUCTION OF SYMMETRIC RULED SURFACES. 369 


points by 7’, P on C, and P’ on P», such that P’P is a generatrix of R. There 
are thus 8 such generatrices cutting g. R is of order 8. 


4. Developable ruled surfaces. Every developable surface which is not 
a cone has an “edge of regression” or cuspidal curve which may be any 
space curve. When the developable surface is symmetric, then a tangent of 
the cuspidal curve is transformed into 24 other such tangents by the Gox. 
From this follows that the cuspidal curve is symmetric and that it may there- 
fore be obtained as the intersection of two symmetric surfaces fm 0 and 
gn = 0 of orders m and n respectively. I shall restrict myself to the case in 
which the intersection of fm and gn is a complete irreducible curve Cimn. The 
tangent planes at a point of intersection (y) of the two surfaces, to each 
fm and gn are 

ag 

(11) (12) 
They intersect in a tangent ¢ of the curve of intersection of the two surfaces and 
as (y) describes the curve of intersections Cmn, ¢ describes the developable 
surface D which is now symmetric. Its order is obtained by eliminating 
(y) from fm 0, gn =O, (11) and (12), which gives for the order of D 
mn(n—1) + mn(m—1) or 


(13) d=mn(m+n—2). 


The order of the double curve of D, since Cmn has supposedly no effective 
singularities is 

(14) d= 4mn(m + n— 2)[mn(m + n— 2) —4], 

according to a well known formula (Cayley). D has a double curve of order 
dmn(m +- »—2) (which is always possible, because d is even) in each of the 
six planes 4; — 2, —0. Hence, outside of these components, the double curve 
of D contains a residual double curve of order 


(15) o = 4mn(m + n— 2)[mn(m + n— 2) — 10]. 


To find the point of intersection of ¢ with the unit plane, we solve (11), (12) 
and Sa, 0 for (x). Denoting df/dy; and 0g/dy; by fi and g; respectively 


pr, = — + fsG2 — fogs + fe9s — 
(16) — fs9s + + — f391) 
pl; = fogs — fage + fags —figs + fige — fogs 


pts = — (fogs — fage + — figs + fige — 
10 


i) 
ns 
4) 
ic 
ic 
eg 
) 
Ig 


370 ARNOLD EMCH. 


As the point P(A) describes Cmn, the point (x) in (16) describes the curve 
of intersection D’ of D with the unit plane. The join of D’ with P is a tangent 
of Cmn or a generatrix of D. With » variable we have 


THEOREM 5. The parametric representation of the developable tangent 
surface of Cmn the curve of intersection of fm and gn is given by 


= fsgs— fags + fage — fogs + + pra 
ot, = — (fsga— fags + fagi — figs + figs — fogs) 
(17) = fogs — + fags — figs + fige — fogs + pAs 
ot, = — (fogs — + — figs + fig2— + prs; 
fm = 090, On = 0. 


As may be expected these are precisely of the type represented by (3) and (4). 

Example. The simplest developable symmetric surface is obtained as the 
tangent surface of the sextic C, of genus four obtained as the intersection of 
the symmetric cubic and quadric 


M 0 and N = dirs, Al —1, 2,3, 4). 
In this case (17) becomes 


pt, = — (Az — Az) (As — As) (Ag — Az) + 
(18) (Ay — Az) (As — Aa) (Ag — Ar) + pre 
d = — (Ai — Az) (Az — Aa) (Ag — Ax) + prs 

pts—= (Ay —Az) (Az —Asz) (As — Ax) + 


a symmetric developable surface of order 12, whose equation in the 2’s is 
obtained by elimination of p, », and the d’s from (18) and M=0, N=0. 
This is omitted on account of the enormous amount of labor which is required. 


UNIVERSITY OF ILLINOIS. 


i 
i 


irve 


rent 


lent 


is 


ed. 


ON CIRCLES CONNECTED WITH THREE AND FOUR LINES.* 


By J. R. MussELMAN. 


I. Let us consider six points a; (1 1,2: - -6) of a plane (or sphere) 
which are ordered. A quadratic covariant associated with them has been dis- 
cussed by the Morleys.t. Here we study the six points under the condition that 
the cross-ratio 
(a — 2) (ds — a4) (ds — ae) 
(dz — ds) — a5) (de — a1) 


(1. 1) P> 


where p is any real number. Since this cross-ratio is invariant under 
homographies 


y = (ax + B)/(yt + 8) 

we are asking that it be invariant also under antigraphies 
= (ax + B)/(ye + 8). 

The significance of (1.1) is easily seen. For the equation 


(4; — G2) (dg — _ 
(@2 — as) (47 — 


P1 


represents a circle on the points a, d@2 and as. Writing similar expressions for 
the circles a;4,4; and d;4¢a,, and multiplying the three equations together we 
see that (1.1) is precisely the condition that circles a,A2d3, Ag4445, 52g, have 
acommon point. Furthermore, the condition that circles d24304, 440506, A102 
have a common point is 

(dz — dz) (44 — as) (de — a1) , 


Now the truth of either one of (1.1) or (1.2) implies the truth of the other, 
hence we have here a simple proof of the theorem if the circles a,203, 34445, 
meet at a point, say m; then the circles meet at a 
point, say n. 

If we choose the points a; so that m is oo, then do, a4, dg lie respectively 
on the lines a,d3, a3; and d54,. This gives us then the theorem of Miquel— 


* Read before the International Congress of Mathematicians, Oslo, July 17, 1936. 
Received by the Editors, November 30, 1936; revised February 4, 1937. 
* Inversive Geometry, p. 60. See also exercise 6, page 30. 


371 


4), 

the 
of 
0. 


372 J. R. MUSSELMAN. 


if a point be marked on each side of a triangle, and through each vertex of the 
triangle and the marked points on the adjacent sides a circle be drawn, the 
three circles meet at a point. We notice also that the homography which sends 
a, into a4, ds into and a, into a, sends the line a,a,0 into the circle 
so that in general a,44, A34¢, 4542, and mn are pairs in a homography. 


II. In this section we connect the above with the following generalization 
of an old theorem? that if a point p has images do, a4, de in the sides of a 
triangle then the circles meet at a point, say n, 
on the circle a,a3a5: and equally the circles a,d203, meet at a 
point, say m, on the curcle dzaag. We give an analytic proof of the first part 
which enables us to construct the points m and n without the aid of the circles, 
Let the codrdinates of the vertices of the triangle a,a,a; be turns {;, i.e. 
| ¢;| 1. The elementary symmetric functions of the ¢; will be denoted by «. 
The reflection a, of the point p in the side a,a3 will have as codrdinate 
+t, —t,t,p. Consider the equation 


(2.1) (4, + —o3p + (t,t; — o3p)T. 
For T = —1, respectively, this circle (2.1) passes through the 
points ds, a4, ds. Also for the turn value 
(o2 —osp) (ts —p) 
(o, — p) (tits —osp) o1— p 


a point on the circle a,a;a;. Since the coordinate of this point, n, is symmetric 
in t,, ts, ts, the circles agd,42. and d2030, likewise are on n, which proves the 
first part of the theorem. The second part follows immediately from the 
homography which connects the pairs @,04, 34g, 542, m and n, for it is of 
period two. 

For if p and a, are images in the side a,a3; then (a, —a2)/(ad2—ds;) and 
(a, — p)/(p—asz) must be conjugates. Writing similar expressions for p and 
a4, p and ad, and multiplying the three we obtain 


(a, — Az) (dz — a4) (as — ag) 
(dz — dg) (4 — ds) (46 — a1) 


But this is the condition that a,44, d24;, d34. be pairs in an involution. 
Naturally m and n belong to this involution. When m is o, we have the 
theorem of Menelaus. 


2This is a generalization of a theorem of Canon, Nouvelles Annales de Mathé- 
matique, Fourth Series, vol. 8 (1908), p. 480, Problem 2108. See also R. Bouvaist, 
ibid., vol. 10 (1910), p. 136. 


h; 
fo 
pe 
07 
st 
w 
( 
H 
fr 
(; 
m 
P 
re 
in 
to 
re 
pe 
0 
re 
(1 
A 
at 
pc 
|. 
al 


ON CIRCLES CONNECTED WITH THREE AND FOUR LINES. 373 


Now if H be the orthocenter of a,a;4;, any point on the line Hp will 
have as coordinate r= (o,+Ap)/(1+ A). The codrdinate of n will be 
(oz —o37)/(o,— Substituting the values for r and in this expression 
for n we obtain (o,.—o3p)/(o,—p). Since this is independent of the 
parameter A, we see that the point n is a fixed point on aaa, for all points 
on the line Hp. The Simson line of n is parallel to Hp, so n can be con- 
structed * without the use of circles. The codrdinate of the point m can be 
written as 


1— pn’ 


(2. 2) m = 


The line pn cuts the circle a,a,45 at a point L, whose coordinate is 


(2. 3) 
Hence if O be the center of the circle a,4;4;, the point n can be constructed 
from the vector relation 


III. Casey * defines as “ twin points ” any two points such that the angles 
subtended by the sides of a triangle at these points are either equal or supple- 
mentary. With reference to the triangle a,a,a; the points p and m are twin 
points. It is also known that any two points at the ends of a diameter of a 
rectangular hyperbola are twin points with reference to any triangle inscribed 
in the hyperbola. The theorem in Section II connecting p and m enables us 
to state two interesting theorems about the rectangular hyperbola. (1) Jf we 
reflect any point p of a rectangular hyperbola in a chord a,a3, obtaining the 
point ao, then the circle d,a243 will intersect the hyperbola at m, the diametrically 
opposite point of p. (2) Let Ay, As, As, P and Q be any five points of a 
rectangular hyperbola with P and Q at the ends of a diameter, and let P;, Qi 
(t—=1,2,3) be the reflections of P and Q respectively in the chords A2Asz, 
A;A,, A,A,; then the circles A,A3P,, A,A,P; and P,P.P,; will meet 
at Q and the circles A,A3Q1, AzAiQ2, A1:A2Q; and will meet at P. 

One application to the geometry of the triangle will indicate the im- 
portance of this theorem. The isogenic centers or Fermat points F, F” of a 
triangle A,A,A, lie on the ends of a diameter of the Kiepert hyperbola which 
also passes through the vertices of the triangle. Hence if we reflect either 
Fermat point in the sides AA, AzA1, Ai1A2, obtaining the points B,, B, and Bs 
the circles A,A;,B;,, AzA,B2, A,A2B;, B,B.B,; are on the other Fermat point. 


*See R. A. Johnson, Modern Geometry, page 208, Theorem 329. 
*A Sequel to Euclid, Sixth Edition, page 249. 


374 J. R. MUSSELMAN. 


IV. We consider here the reflections of a point p in four lines. Our 
coordinate system is chosen so that the parabola which touches the four lines 
has the form 

2 


If p be the focus of the parabola, and we reflect it in the triangle formed 
by the tangents at the points ¢,, t2 and ts the codrdinate of the point n will be 


2 
1—8,+ t,’ 


i= 


where S, is the symmetric function of four points ¢;. For each triangle formed 
by three out of the four lines we shall have a point n, and these four points n 
lie on the circle C,, whose equation is 


2 


The inverse of the focus of the parabola as to this circle is the point 


For five lines of a parabola, we can construct five circles as C4, and the inverse 
point of the focus in each circle will lie on a circle, namely on 


2 


where now g;, refers to five points t;. The chain runs on indefinitely for lines 
of a parabola. 

When 7p is a point on the circumcircle of the triangle, we see from (2.3) 
and (2.4) that Z coincides with p and that m is at the orthocenter. Hence 
if we reflect the focus of a parabola touching any four lines in the four lines, 
the four points m, one for each three of the four lines, lie on a line—the 
directrix of the parabola. 

For p any point in the plane, the coordinate of n is 


_ *(p+p—?) = 
"Tip + 2(o,—1)’ 
The numerator of this fraction equated to zero is the equation of the directrix; 
the denominator equated to zero gives the orthocenter of the triangle of tat- 
gents at the points ¢,, t2, ts. We thus have the theorem if we reflect any point 


( 
t 
p 
e 
W 
e 
ti 
W 
ag 
a; 
W 
sk 
of 
pe 
T 


ON CIRCLES CONNECTED WITH THREE AND FOUR LINES. 375 


on the directrix of a parabola in the triangle formed by three of its tangents, 
the point n coincides with the focus. Naturally n is indeterminate when p is 
the orthocenter of the triangle of tangents. 


V. If we choose as the point p the circumcenter of a triangle A,A2Asz, 
then from (2.2) the codrdinate of m can be written as 


Ms = 01, — 02/01 


where o; are symmetric functions of the three symbols ¢;. If we take a fourth 
point A, on the circle A,;A2A; we can form four triangles from the four points, 
each determining a point m;, which four points m, lie on a circle. For 


Si —ts 4 


M4 = 0, — 02/0; 


where S; are symmetric functions of the four ¢;. This, for variable ¢,, is the 
equation of a circle; hence the four points m, lie on the circle C;, whose equa- 
tion may be written as 

Ay — Ayt 


a,—t 


where we write a, for S,2——S, and a, for S,;. Now when t—0, t= a2/a,; 
when t= 0, ~—<a,. Therefore, the points a2/a, and a, are inverse points 
as to this circle. Four other pairs of points in the involution set up by this 
circle are Aym,4, A;m;, Azm2, Aym,. Hence the five pairs of points a2/a, and 
a,, A; and m; are pairs of an involution. Since 


— Aylts 


ay a,—t, 


where a; are the functions S, and S,?— S, written for five symbols t;, we have 
shown that if we take five points on a circle, we can determine for each four 
of the five points a circle such as C; and a point such as a,, and the inverse 
point of each a, as to its associated circle gives five points which le on a circle. 
This chain can go on indefinitely. 


WESTERN RESERVE UNIVERSITY. 


T 
8 
———- 


ON THE DENSITIES OF INFINITE CONVOLUTIONS.' 


By AUREL WINTNER. 


It has been pointed out in a previous paper * that, due to certain compact- 
ness properties of uniformly bounded functions of uniformly bounded variation, 
there exists for the “term-by-term” differentiation of infinite convolutions 
of distribution functions a theorem which does not assume any convergence 
property of the sequence of derivatives and is, therefore, useful * in applications. 
The present note proves an essential refinement of the theorem in question by 
showing that, for the “ smooth ” term of the infinite convolution, the existence 
of an additional derivative need not be required.* In fact, it will be shown that 
if at least one term oj; of a convergent infinite convolution o, * a2, *- + +, say 
the term oj =01, has for— «0 <4< + om a continuous density of bounded 
variation, then so does the distribution function o represented by the infinite 
convolution ; and the continuous density of o, *on tends, asn—>-+ o, 
to that of o =o, uniformly in every fixed bounded z-range. 

Let V(y) denote the total variation (= + o) of a function y(z), where 
—«o<2<-+ oo. Suppose that the derivatives ¢’,(r) of a sequence of uni- 
formly bounded, differentiable functions ¢,(x),— 0 <2 < + o, are such that 


(i) {¢’n(z)} is equicontinuous for —o +o; 
(ii) | ¢’n(z)| SC for some constant C; 
(iii) V(¢’n) Sc for some constant c. 


Then ° the sequence {¢,(2)} cannot tend to a limit function lim ¢,(z) almost 
everywhere unless lim ¢n(x) determines for —0o <x< + o a continuous 
function which has a uniformly continuous derivative of bounded variation, 
in which case 


gn(x) > lim and ( lim 


OO 


1 Received January 28, 1937. 

2A. Wintner, American Journal of Mathematics, vol. 57 (1935), pp. 363-366. 

° Cf. E. R. van Kampen and A. Wintner, ibid., vol. 59 (1937), p. 186 and p. 203. 

‘The assumption made loc. cit.2 was that o, has for — © < # < + © an absolutely 
integrable and bounded second derivative. It is clear that this assumption implies the 
existence of a continuous first derivative which has for —© < # < + © a bounded 
variation. Incidentally, a function of bounded variation, which is a derivative, cannot 
have discontinuities. 

5 This is a corollary of the facts proved loc. cit.,? pp. 364-365. 


376 


Ne 


Th 


80 


he 

F 

| is 

Pp 
Va 
| of 
th 
th 

a 
In 

tr 
als 

Si 

it 

di 

| 
| | 


ON THE DENSITIES OF INFINITE CONVOLUTIONS. 3T7 


hold uniformly in every fixed bounded z-range. Now if gn =o; * on, 
then {¢n(x)} is uniformly bounded, since ¢, is then a distribution function. 
Furthermore, —o(zx) holds almost everywhere, since = a; * o2 

is supposed to be a convergent infinite convolution. Hence it is sufficient to 
prove that the assumption of a derivative o’;(x), <x< + bounded 


variation implies for the finite convolutions =o, * on the existence 
of densities $n = ¢’n(x) which satisfy (i), (ii) and (ili), no matter what 
the distribution functions - may be. 


First, if w = (8) denotes, for a fixed 6 > 0, the greatest lower bound of 
those numbers 8 > 0 which have the property that | 0’:(2') —o’:(2”)| SB is 
a consequence of | — | <8 for arbitrary 2’, z*, then > 0 as 80. 
In other words, o’;(x) is not only continuous but uniformly continuous for 
In fact,o’;)(— ©) =0 and 0) since the dis- 
tribution function o,(z) is supposed to be such that V(o’1) << + 0. This 
also implies that 0 = 0’,(x) < V(o’,) for every z. 


On placing tn it is clear that 
+00 
= dn = on = 0; * = y)dtn(y). 
° 


Since o’, is continuous and bounded, and since rt, is a distribution function, 
it follows that ¢, has for every x a derivative ¢’» which may be obtained by 
differentiation beneath the integral sign, so that 


+00 


=f 


Now | 21 — 2? | = 8 implies that | o’, (21 — y) — 0’, (2? — y) | S (8) for every 
y, since | 21 — | = |(z'— y) — (2?—y)|. Hence 


+00 
| $'n(2?) — ¢'n(z?)| S f w(8)drn(y) = (8) whenever | 2?—a?|S8. 
This proves (i), since >0 as 8—>0. Similarly, since | 0’: | << V(o':), 


80 that (ii) is satisfied by C=V(o’,). In order to prove (iii), notice first 


+00 
-0O 


378 AUREL WINTNER. 


that, since V(o’,) < + o, one can write the above representation of ¢’n(z) 


in the form 


where, as pointed out above, oo) —0 and «) —0, while zn(y), 
being a distribution function, is bounded. Hence 


and so, for arbitrary m and x, 


2 | —$'n(ti-s)| S | — 9) — — 9) | | do's (y)|. 
J & 
Now let am. Then for every y; hence 


Consequently, 
+00 


< f | do’,(y)| == V(o',) whenever 2% < 2p, 


and so V(¢'n) = V(o':); i.e., (iii) is satisfied by c= V(o’,). 

It is clear from the proof that the theorem can be extended to the case 
of multi-dimensional distributions and also to the case where one requires for 
o=0,*o,*- + - derivatives higher than the first. 


THE JoHNS HOPKINS UNIVERSITY. 


-00 
+00 +00 
| 
be 
ar 
tir 
+m 
va 
fu 
Si 
(1 
ul 
| 
th 
de 
pa 
va 
v/ 
ve 
ne 
cie 
to) 
Me 
(1! 


A REPRESENTATION OF STIELTJES INTEGRALS BY 
CONDITIONALLY CONVERGENT SERIES.* 


By Fritz JoHN. 


1 
In an earlier paper the author expressed J. f(x)dz, where f(r) is a 
0 


fo 
function of bounded variation by a series of the form 3) af(Ay), av and Ay 
p=1 


being certain constants not depending on f.t The ay and Av contained an 
arbitrary rational parameter y. These expressions for the integral of a func- 
tion were generalized by H. Rademacher,” who assigned to y arbitrary algebraic 
values, and also proved the validity of the expansion for all Riemann integrable 
functions. 

In the present paper I am giving a similar representation for Riemann- 
Stieltjes integrals: 


(1) f du (2) = 


where the ay and cy are independent of f. The sequences ay and cy are not 
uniquely determined by y(x). One might expect that the cy can be prescribed 
arbitrarily to a certain extent (e.g. so as to form an everywhere dense set in 
the interval 1 = «= 2) and that then coefficients ay depending on y can be 
determined such, that (1) holds for all functions f of a certain class. In this 
paper a special expansion of this sort is given for the case, that f is of bounded 
variation and that y is continuous; the arguments cy have the fixed values 
v/2llce 4], independent of y. The series is in general only conditionally con- 
vergent. By introducing y=y(zx) as variable of integration, one can obtain 
New expansions af (Av) for the Riemann integral the coeffi- 
1 


cients ay and the arguments Av depending on an arbitrary continuous, mono- 
tonic function y(z). 

It would be desirable, to prove the more general theorem, that every 
continuous linear operator for functions of bounded variation f (the total 


* Received December 2, 1936. 

1“Tdentitaten zwischen dem Integral einer Funktion und unendlichen Reihen,” 
Mathematische Annalen, vol. 110, pp. 718-721. 

2“ Some remarks on F. John’s identity,” American Journal of Mathematics, vol. 58 


(1936), pp. 169-176. 


379 


= 


380 FRITZ JOHN. 


variation of f taken as its norm) can be represented by a series of the form 
df (cv). 
In what follows we shall denote with Z the function defined by 


1 for z= 0. 


THEOREM. Let f(x) be of bounded variation and w(x) be continuous in 
Then 


J, af), 
the coefficients ay being defined by the following conditions: 
(2) = + (2v— 2) —y(2v — 1)) for positive integers v, 
(3) dy = 4(¥(v) —y(v— 1)) for odd positive integers vy, 
(4) am = y(2) —y¥(1). 
Proof. We have for v= 0,1, 2,- - - 
(5) 


Moreover, as ¥(z) is continuous, there exists for every positive e a 8(e€) such 
that | ¥(21) —y(22)| <eforlSz, 52,12, $2 and | |S 8(e). 

Let v be an odd number > 2/8(e) + 1. Then [log, v] = [log, (v—1)] 
and 


1 
|y—v—1 | = 
therefore | a | S «/2. 


Let v be even and > 2/8(e) +1. Then 


[logs (v — 2) ] = [loge (v—1)], 
1 
2 [ioge (v-1) J < 8, 


| y(v—2) 
| av | =| + —¥(v—1))| 
Thus for all y > 2/8(e) +1 
| | S4| ape) | + €/2. 


Consequently we have, if M, denotes Maximum ay 
2n=p =< gn-1 


Max = $M, + 


|v—2—v—1| 


No 


No 


besi 


fc 
pt 

(( 

T 

Ww 

He 

(7 

As 

we 

Fr 
| ( 
(9 


A REPRESENTATION OF STIELTJES INTEGRALS. 381 


for sufficiently large n; therefore lim sup M, S «, and as e was an arbitrary 


positive number, lim M, Hence 
(6) lim ay = 0. 


Let for n= 0,1,2,:-- and forrl=z< 2 


Then for positive n 


Sn(Z) — x (dav + Goy-1) 


where 
__ | 1 if [2"x] is odd 


0 if is even. 


Now according to (2), (3) 
+ = + — 2) —y(2v—1)) 
+ —y(2v—2)) = dav. 
Hence 
(7) Sn(Z) = $8n-1 (2) — + 


As according to (6) | a... | and | apo2) | are less than ¢/4 for sufficiently big n, 
we have for those n 


| n(x) | | + 
From this we may conclude, that 


(8) lim s,(z) =0 uniformly in for 1S < 2. 
n->0O 


It follows moreover from (7), that 


N N-1 N N 
(9) sn (2) == 4 $n(2) — + 
Now forn=1 

N 

— (2% — 1) — ; 
besides for odd 

— $ (¥([2"2]) 

) 


ay. 


382 FRITZ JOHN. 
and for even : [2"7] 


Thus 


Substituting these expressions in (9), we obtain 


$3 dou (2) + 50(2)— + vO) — + 
Now 
=a, = 3(y(1) —y¥(0)), 
lim y (2% —1) =y(2), 


y([2%a]) =y 


lim y([2%zr]) —y(z) uniformly in for 1S 2. Hence using (8) it 
follows that 
(10) sn(z) (x) —y(2) uniformly in for 1S < 2. 
n=0 


Let the step function e(y, 7) be defined by 
lifySe 


= 4 if y >a. 
Then for l1=2< 2 


n n 
LY w 
p=1 


[loge n)-1 
= > > av > dy 
p=0 2llogs min (n, 2 [logs 1) g) 


[logs n]~-1 Sto ifi>z 


u=0 


Because of (8) and (10) it follows that 
oO 

(11) we(v, 2) = —y(2) uniformly in 


for 1S2<2. As lime(y,z) =1 and limy(z) =y(2), (11) holds for 


@-2 @ 


Ce 


Ai 


CO! 


the 
the 


1 

( 

it 

| TI 
| at 
Le 

Th 

var 


A REPRESENTATION OF STIELTJES INTEGRALS. 383 


Let now f(z) be of bounded variation in 1 = 2. 
We first assume, that f is also continuous. Then 


Hy) —— 2)df(z) +702). 


Consequently 


af(7) ave(s, df(x) + f(2) > ay. 


As S wey, x) converges uniformly in x and f(z) is of bounded variation, 
it follows using (11) and (12) 


+ ¥(2)f(2) 
+ (V2) —¥@2)). 
This proves our theorem (cf. the definition of a)) for the case that f is 
continuous. 


Let f(x) be of bounded variation in 1 <2 2 and have discontinuities 
at the points (u=1,2,: Let 


f(éu + 0) —f(éi— 9) f(&.) —f(é.— 0) = dy. 
Let moreover x(z,¥) be defined by 
9) = 4 


Then f(2) =f,(z)+ fs(x), where f,(x) is continuous and of bounded 
variation, 


0 if ry 

fo(a) == cue (a, én), fs(2) = (Eu, 7), 


co 
the series }\ | c, | and & | dy| being convergent. It is sufficient to prove our 
p=l 


theorem for f=fi,f—f2, f=fe separately. For f =f, it follows from our 


is 2; i.e. 
co 
(12) 
p=1 


384 FRITZ JOHN. 


previous considerations. As (1) holds according to (11) for f(z) = e(z, &) 
oo 

and > ae(v,é.) converges uniformly in » and > cy converges absolutely, it 
v=1 

follows, that (1) holds as well for f=f.(x). In order to prove (1) for 

f =f:(z), it is only necessary to prove, that 


oo 
(13) éu) =0 
uniformly in »p. But, as 
x(v,z) lim (e(v, x) —e(v,2—h) ) 
it follows from (11), that 


uniformly in z for 1 
Thus (1) is proved generally for any f(z) of bounded variation. 


Remarks. If there is an e such that | f(r)| > for all z inl S752, 


co 

then the series >} avf(v) is certainly not absolutely convergent, unless y(z) 
v=0 

is a constant. 


For if that series would be absolutely convergent, then 5 | a, | would be 


convergent ; consequently according to (11), (13) 


y(t) —Y(2) = 2) = 2) 


v=1 vodd 
vy odd s=0 
oO 
v odd 8=0 
= e(¥,r) = 0. 
v odd p=0 


II. Let y(z) be monotonically increasing and continuous and let 
y(2) =2, y(1) =1. Then we obtain by substituting t —y(x) in (1) the 
identity 


f(t)dt auf (dv) 


valid for every function f of bounded variation, ay being defined by (2), (3); 
(4) and Ay denoting (vr). 


UNIVERSITY OF KENTUCKY. 


i 
t 
t 
t 

d 
t] 
| 
j al 
| 
| 
W 

(1! 

Pp. 


NOTE ON THE DEFINITION OF FIELDS BY INDEPENDENT 
POSTULATES IN TERMS OF THE INVERSE OPERATIONS.* 


By Davin G. RaBINow. 


1. Introduction. The concept of a field involves two operations which are 
usually called addition and multiplication. When the definition of the field 
is given in terms of these operations, we say it is defined for the direct opera- 
tions.. The inverse operations of these direct operations may be called sub- 
traction and division. It is possible to define a field in terms of these inverse 
operations.” The inverse operation of multiplication, division, when used as 
the fundamental operation, allows us to define multiplication in two essentially 
distinct ways. The first involves the fact that 1/1/a =a under certain restric- 
tions on a. We can then define ab =a(1/1/b). This is the method used in 
the paper referred to in footnote 2. The second method is probably the more 
fundamental in that it is analogous to the definition of division in terms of 
multiplication. We shall develop the definition of a field in terms of addition 
and division where our treatment of division shall be this second method. 
Instead of using subtraction we shall use addition, since this will simplify the 
proofs somewhat and since the use of subtraction has been completely discussed 
in the paper referred to in footnote 2. 


2. Postulates for a field and theorems deducible from them. Let us 
consider the following set of postulates in connection with the base (K, +, 0) 
where K is a class of elements a, b, c,- - - and +, 0 are binary operations. 


Postulate 1. ain K andbin K implya+0 ink. 

Postulate 2. Ifa,b,c,a+b,b+c, (a+b) +c,a+ (b+ ¢) are in 
K, then (€ +0) (b+ ¢). 

Postulate 3. There existsin K at least one element Z such thata + Z =a 


for all ain K. 


* Received December 28, 1936. 

*E. V. Huntington, “ Note on the definition of abstract groups and fields by sets 
of independent postulates,” Transactions of the American Mathematical Society, vol. 6 
(1905), pp. 181-193. 

*D. G. Rabinow, “ Independent sets of postulates for abelian groups and fields in 
terms of the inverse operations,” American Journal of Mathematics, vol. 59 (1937), 
pp. 211-224, 

11 385 


) 
T 
) 
t 


DAVID G. RABINOW. 


Poslulate 4. For each element a in K there exists at least one element 
a’ in such that a+ a’ =Z. 

Postulate 5. ain K and b in K and b~Z imply aob in K. 

Postulate 6. If a, b, c, aob, aoc, (aob)oc, (aoc)ob are in K, then 
(aob) oc = (aoc) ob. 

Postulate 7%. Ifa, b,c,a+b, aoc, boc, (a + b)oc and aoc + boc are in 
K, then (a + b)oc = aoc + boc. 

Postulate 8. ain K and b in K and a¥Z imply the existence of an 
unique element x in K such that zoa = b. 

Postulate 9. If a, b, aoa, bob are in K, then aoa = bob. 

Postulate 10. There exist at least two distinct elements in K. 


Note 1. If we wish we may remove the uniqueness requirement in Postu- 
late 8 and insert as an additional postulate either Lemma 2 or Lemma 3 proven 
below. However for compactness and for simplification of independence proofs, 
as well as for reasons which will become apparent in Section 4, the present 
form of Postulate 8 is desirable. 


Note 2. Throughout the subsequent work the terms Postulate, Theorem, 
Lemma and Definition will be referred to respectively by P, T, L, and D. 
Those theorems deducible from Postulates 1 through 4 shall be assumed as 


known and will be referred to as G. 
Lemma 1. If then Zob =Z. 


Let a be any element in K. Hence since b ~ Z, we have by P5, P3, and P%, 
aob = (a+ Z)ob =aob + Zob. Whence by G, Zob = Z. 
Lemma 2. If SZ, then aob-Z. 


Since b ~ Z, there exists by P8 an unique element z such that zob = Z. Hence 
by L1, x must be Z. If aob = Z, then a must be Z. But this is a contradiction. 
Hence aob Z. 

Lemma 3. If a, b, c, aoc, boc are in K and if c4Z and if aoc = boc, 


then a= b. 


By P4 there exists an element a’ such that a+ a’=Z. By P5 a’oc is in K. 
Hence by P1 a’oc + boc =a’oc + aoc. Whence by P? (a’ + b)oc = (a+ a’)0c 
=Zoc by P4. Hence by L2, a’+b-—Z. Therefore by Ga=b. 

At this stage we are in the position to define: 


Definition 1. There exists an unique element U=aoaZ. For, by 
P10 there exists in K at least one element as4Z. Hence by P5 aoa is in K 
and by L2 aoa Z. By P9 this defines the unique element U = aoa = bob. 


386 
te 
( 
P 
el 
i 
If 
of 
| 
} He 
He 
ap 
the 
(a 
a 


DEFINITION OF FIELDS BY INDEPENDENT POSTULATES. 387 


Lemma 4. If under the conditions of P8 the element b 1s also not equal 
to Z, then cob =a. 
For, Hence by P5 (zoa)ob = bob =U by Di. Whence by P6 


(cob)oa = U = aoa since a4 Z. Therefore by L3 zob =a. 
We can now define the product (written ab) of any two elements a and b 


as follows: 


Definition 2.1. b=Z, then ab =Z. 


Definition 2.2. b= Z, then ab shall be the element x of P8, that is the 
element x satisfying the equation zob =a. From these definitions we have 


immediately 
THEOREM 1. ain K and b in K imply ab in K. 
THEOREM 2. If a, b, ab, ba are in K, then ab = ba. 


Case I. b=Z, then by D2.1ab—Z. Ifa—Z, then by D2.1 ba=Z. 
Ifa Z, then by D2. 2 and L2 ba=Z. 


Case II. ay Z,bAZ. By D2.2 where rob—a. By D2.2 
ba = y where yoa = and by L4 yob =a. Hence by L3 
THEOREM 3. If a,b,c,ab,bc, (ab)c,a(bc) are in K, then (ab)c—a/(be). 


Case I. If a=Z or b=Z or c=Z, the theorem follows as in Case I 
of T2. 


Case II]. aAZ,bA~Z,cHAZ. By D2. 2 let (ab)c —w where woc —ab. 
Hence by D2. 2 (woc)ob =a. Similarly let p—a(bc) = (bc)a by T2 where 
poa = be and hence (poa)oc =b by D2.2. By P6 b = (poa)oc = (poc)oa. 
Hence by L4 (poc)ob =a. Therefore (woc)ob = (poc)ob and by repeated 
application of L3 p = w. 


THEOREM 4. Jf a, b, c, a+b, ac, bc, (a+ b)c and ac + be are in K, 
then (a + b)c—ac be. 

Case I. c=Z. By D2.1 (a+b)c=—ac=—be=—Z; whence by P3 
(a+ b)c—ac-+ be. 


Case II. cZ. By D2. 2 let (a+ b)¢ =p where porc=a+b. Also 
let ac—-2 where xoc—a and let bey where yoc=b. Then by P1 
4+b—-zoc + yor=(c#+y)oc by P?. Hence by L3 


THEOREM 5. If a, b,c, a+b, ca, cb, c(a +b) and ca+ cb are in K, 
then c(a +b) =ca+ cb. 


388 DAVID G. RABINOW. 


This theorem follows immediately from T4 and T2. 


THEOREM 6. There exists an unique elementuZ such that au=ua—a 
for allain K. 


Consider the element U  Z defined in D1. 
Case I. a=Z. By D2.1 Ua=Z and by T2 Va —aU. 


Case Il. Let Va =p where poa = U aoa by D2. 2 and D1. 
Hence by L3 p—a. By T2 aU =Ua. Now suppose there exists another U’ 
such that aU’ = U’a=a. From U’a =a we have by D2.2 aoa=U’. But 
aoa = U. Hence U = U’, and U is unique. 


THEOREM 7. For any elements a and b in K where aZ, there exists 
an unique element x such that xa = ax = b. 


Let c = boa. Then by D2. 2 p = xa = (boa)a where poa = boa. Hence by L3 
p=b and by T2 ax za. Now suppose there exists another element 2’ such 
that az’ =a’a=b. Then by D2.2 =—boa. Hence 


THEOREM 8. If a,b,a+b,b-+a are in K, thna+b=—b-+a. 
Let d be any element of K Z and let D be the element x of T? such that 
dD =U. By T4 and T5 


(a+ b)(d+d) =a(d+d) +0(d+d) =ad+ad-+ bd + bd. 
Likewise by T4 and T5 
(a+ b)(d+d)=—(a+b)d+ (a+ b)d=—ad+ bd-+ ad + bd. 


Therefore 
ad + ad + bd + bd =ad+ bd-+ ad + bd. 


Hence by P1, P2, P3, and P4.ad + bd = bd + ador by T5 (a +b)d =(b +a)d. 
Multiplying by D and using T1, T3, T7 and T6 we have a+ 6 =b +a. 

But P1, P2, P3, P4, T1, T2, T3, T4, T5, T6, T7, T8 are the postulates 
for a field in terms of the direct operations of addition and multiplication 
(Huntington). Hence any system (K,+,0), which satisfies Postulates 1 
through 10, is a field with respect to the direct operations of addition and 
multiplication. Furthermore from Theorem 7 we see immediately that the 
operation o is the inverse operation of multiplication. To complete the proof 
that our set of postulates is both a necessary and sufficient set to define a field 
we must show that from P1, P2, P3, P4, T1, T2, T3, T4, T5, T6, T7, T8 
we can deduce P5, P6, P?, P8, P9, P10. For this purpose we define the 
operation o as follows: 


th 


aol 


or 

| 
un 
an 

the 
ele 
Pl 
(ac 
anc 
aol 
Sir 

(a 

L5 
bot 


DEFINITION OF FIELDS BY INDEPENDENT POSTULATES. 389 


Definition 3. If ais in K and 6 is in K and if aZ, then z= boa 


my 


shall be the element x in T7 satisfying za = b. 


LemMA 5. Jf a, b, c, ac, be are in K and if c¥Z and if ac=—bde, 
then a= b. 


For, by T7 the element c’ exists such that cc’ = U. Then by T1 (ac)c’ = (be)c’ 
or by T4, T7, T6 a—b. 


THEOREM 9. ain K,bin K andb AZ imply aob in K. (P5) 
Follows immediately from D3. 


THEOREM 10. There exist at least two distinct elements in K. (P10) 
These are the elements Z and u whose existence is postulated in P3 and T6. 


THEOREM 11. a, b, aoa, bob in K imply aoa = bob. (P9) 
By T6 wa =a for all a. If a AZ, then by D3 u—aoa for all a for which 
aoa is in K. 


THEOREM 12. a in K, b in K, and aAZ imply the existence of an 
unique element x in K such that coa=b. (P8) 


Take «ba. Then by D3 zoa=—b if aAZ. Now suppose there exists 
another element such that = 6. Then by D3, 2’ = ba. Hence =z. 


THEOREM 13. Jf a,b,c,a+b, (a+ b)oe, aoc, boc, aoc + boc are in K, 
then (a + b)oc = aoc + boc. (P7) 


By P1 and T9 (a+ b)oc, aoc, boc are in K if cf Z. By TY? there exists an 
element x such that zc a+b. Also by TY there exists elements y and w 
such that ye =a and we=—b. Hence yc + (y+ w)c by 
Pl and T4. Therefore by L5 4 = y-+ w and the theorem follows by D3. 


THEOREM 14. Jf a, b, c, aob, aoc, (aob)oc, (aoc)ob are in K, then 
(aob)oc = (aoc)ob. (P6) 


Case I. a=Z. By D3 (aob)oc = (aoc)ob =Z by T7. 


Case II. a~Z. By T9 aob, aoc, (aob)oc, (aoc)ob are in K if bZ 
and cAZ. Let (aob)oc—=-z and (aoc)ob=—w. By D3 xc=—aob. Take 
a0b = y where by D3 yh =a. Likewise wh = aoc and aoc = p where pe =a. 
Since zc = y, then ach = yb =a and since wh = p, then whe = pe =a. Hence 
(xc)b = (wb)c = (wc)b by T1, T2, T3. Hence by repeated applications of 
Li c= w. 


From the above we conclude that the set of Postulates 1 through 10 is 
both necessary and sufficient to define a field. 


| 


DAVID G. RABINOW. 


8. Independence of the postulates. The postulates are examined for 
independence by exhibiting examples of systems (K,-+,0) which fail to 
satisfy the correspondingly numbered postulates but satisfy the remaining 
postulates. 


Example 1) K is the class of two elements 0,1 with a+ 6 and aob 
satisfying the following multiplication tables. The 
elements u, 7, and s are elements not in K. 


0 
r 0 


aob 
0 
1 


Example 2) K is the class of all rational numbers, positive, negative, 
and zero. a+b=—a+2b. aob—a/b. 

Example 3) K is the class of all positive rational numbers. a+) 
=a+b. aob—a/b. 

Example 4) K is the class of all positive rational numbers including 
zero. @+b—a-+b. aob—a/b. 

Example 5) K is the class of all integers, positive, negative, and zero. 
atb=a+b. aob—a/b. 

Example 6) K is the class of hypercomplex numbers of the form 
71+ wt+ pj where z, w, p are rational numbers, 
positive, negative, and zero. a+b—a-+b. 


1 
+ p2” +(w2—pe2)? 
X (m1 + + pif) (21 + + p2))» 


where the product of the coefficients shall be the 
ordinary product of rational numbers and the “ units” 
shall follow the table. 


aob = 


1/1 —i —j 
i ji 1 —1 


Example %) K is the class of all integers, positive, negative and zero. 
atb=a+b. aob=—a—b. 

Example 8) K is the class of all integers, positive, negative and zero. 

at+b=a+b. aob=0. 


390 
la 
| 
in 
a+b|0 1 
773 th 
a 
or 
to 
co 
mi 
| it 
th 
| P§ 
Al 
| de 
| C01 
Sys 
ele 
are 
By 
by 


Vs ~ 


DEFINITION OF FIELDS BY INDEPENDENT POSTULATES. 391 


Example 9) XK is the class of all rational numbers, positive, negative 
and zero. a+b=—a+b. aob—ab. 

Example 10) XK is the class consisting of the element 0 only. a+) 
=a-+b. aob is undefined. 


4, The concept of operational invariance. Let us consider the Postu- 
lates 1 through 8 inclusive. It is to be observed that the commutative 
postulate, that is aob = boa, cannot be deduced from these postulates. An 
independence example for this postulate would be: K is the class of all rational 
numbers, positive, negative and zero. a+b—a-+b and aob—a. We note 
further that if the commutative postulate is added to the set of Postulates 1 
through 8, we obtain the definition of a field in terms of the direct operations, 
provided that we define aoZ = Z. In other words we have found a set of 
postulates (P1 through P8) which defines a field in terms of either the direct 
or the inverse operations depending on what additional postulates we desire 
to add to it. This naturally suggests the following problem: Suppose we 
consider the set Pi through P8 as a distinct set of postulates involving the 
operations + and o. Let us further consider the inverse operation of 0, which 
may be defined by means of P8. Call this operation X. Replace o, wherever 
it occurs in the set P1 through P8, by X. We now have a new set P1’ 
through P8’. The problem is can we from P1 through P8 deduce P1’ through 
P8’? The purpose of this section of the paper is to prove that this is true. 
Any system (K,-+-,0), which has this property of replacing o by YX, is to be 
defined as an operationally invariant system with respect to the operation o. 
It is clear that a field does not have this property but that there exists a subset 
of the postulates of the field, namely P1 through P8, which does. (This 
concept of operational invariance may obviously be extended to any type of 
system in which an inverse may be defined.) By P8 there exists a unique 
element 2 such that cob =a if b4Z. This enables us to make the following 


definition : 
Definition 4. «—aXb if x is the element satisfying P8 when b 4Z, 
that is, if a is the element such that rob =a. This proves 


THEOREM 15. ain K,b in K andb imply aXb in K. 


THEOREM 16. Jf a,b,c, a+b, (a+b)Xc, aXe, bXc and aXe + bXe 
are in K, then (a+ b)Xc—aXc + bXe. 


By T15 aXc, bXc and (a+ b)Xe are in K if c~Z. Let (a+b)Xe=—w 
where by D4 woc=a+b. Also let aXc—p and bXc—=q where again 
by D4 poc—a and goc=b. Hence by Pl poc+qoc—a-+b or by PY 


392 DAVID G. RABINOW. 


(p+ q)oc=woc. Hence p+q—w by L3. (It is to be noted that L1, L2 
and L3 are still true since their proofs depended only on P1 through P8.) 


THEOREM 17. ain K and b in K and a¥Z imply the existence of an 
unique element w such that wXa = b. 


Take w=—boa. Since aZ, then by T15 (boa)Xa—r where r is in K, 
Hence by D4 roa = boa. Whence by L3 r—b. Furthermore the element w 
must be unique since by D4 w= boa which by P5 is uniquely determined 
by a and b. 


Lemma 6. Jf a, b, aXb and (aXb)ob are in K, then (aXb)ob =a. 
Let w=aXb. By D4 this means wob = a. 


THeoreM 18. If a, b, c, aXb, aXc, (aXb)Xe and (aXc)Xb are in K, 
then (aXb)Xc = (aXc)Xb. 


To satisfy the hypothesis of the theorem we see from D4 that b #~Z andc Z. 
Take (aXb)Xc = w where woc = aXb by D4. Take also (aXc)Xb = p where 
pob =aXc by D4. Hence by P5 (woc)ob = (aXb)ob and (pob)oc = (aXc)oc. 
By L6 (aXb)ob =a and (aXc)oc =a. Hence (woc)ob = (poc)ob. Whence 
by L3 p=—vw. 


5. Consistency of the postulates. To show that the set of postulates for 
a field is consistent we exhibit the set of all rational numbers with ordinary 
addition and division as our operations. To show the consistency of the set 
of postulates considered in section 4 we may take the set of all rational numbers 
with ordinary addition and either ordinary multiplication or ordinary division. 


HARVARD UNIVERSITY. 


A CORRECTION. 


In my paper referred to in footnote 2, one of the postulates is incorrectly stated 
and several of the independence examples need to be restated. 

On page 215, Postulate 18.2 should be: If a* exists, then a* ~ 2 (provided a 4%, 
U #2). 

On page 223, Example 12 should be: K is the class of all positive rational] numbers 
excluding zero. a—b=a+b. aob=a/b. 

On page 223, Example 13 should have a — b = 

On page 223, Example 17 should be: K is the class of all rational numbers, positive, 
negative, and zero. a—b=—a—b. aob=ab. 


On page 223, Example 18.2 should have aob = a/b except 1/a =0, but 1/1=1. 


[a—b|. 


pr 


| 
| | 

w 
Ph 
rel 
tr 
ge 
de; 
hig 
tec 
of 
wa 
pa, 
the 
ths 
is 
sie 
be 
Ch 
t 


d 


THE REPRESENTATION OF INTEGERS AS SUMS OF VALUES 
OF CUBIC POLYNOMIALS. II.* 


By R. D. JAMEs. 


1. Introduction. In a previous paper under the same title’ the author 
proved the following result. 


THEOREM 1. Let s be an integer = 9 and let P(x) be a polynomial of 
the form 


(1.1) P(x) 2) /6 + — 2) /2 + cz, 


where a, b, c, are integers without a common factor, and a= 4c (mod 8). 
Then every sufficiently large integer is a sum of nine values of P(x). 


The condition as44c (mod 8) was an artificial one which could not be 
removed at the time. In the present paper we shall show that Theorem 1 is 
true without the restriction as44c (mod 8). The method of proof was sug- 
gested by two recent papers by L. K. Hua.? The new idea introduced by him 
may be explained briefly in the following way. If ®(x) is a polynomial of 
degree & with integral coefficients, an integer 6 was defined in § 2, I to be the 
highest power of a prime p which divided every coefficient of ®@’ (x). Hua 
defines 6 to be the highest power of a prime p for which p’|®’(zx) for all in- 
tegers 2. It may be shown that the two definitions are equivalent in the case 
of cubic polynomials except when p= 2. It is this difference when p= 2 
which enables us to avoid the restriction a4 4c (mod 8) in Theorem 1. 

The results obtained by Hua in the second of the papers to which reference 
was made above are correct, but there is an error at the beginning of § 16, 
page 45. The proofs which he gives are therefore not complete. He makes 
the following statement: — “ Let 6 be the highest power of a prime p such 
that ®’(h) ==0 (mod p*) for all integers 2.” The ®(h) which he is using 
is an integral-valued polynomial and not necessarily one with integral coeffi- 
cients. Hence ®’(h) need not be an integer and the congruence #(h) =0 


* Received February 18, 1937. 
*American Journal of Mathematics, vol. 56 (1934), pp. 303-315. This paper will 
be referred to as I. 
7 American Journal of Mathematics, vol. 58 (1936), pp. 553-562; Journal of the 
Chinese Mathematical Society, vol. 1 (1936), pp. 23-61. 
393 


394 R. D. JAMES. 


(mod p’) is meaningless. In Lemmas 1 and 2 we shall show how to avoid 
this difficulty. 


2. The proof of Hua’s results. We first introduce the notation to be 
used. Let 


+1): 
j! 


where the a; are integers. Let d be the least common multiple of the de- 


nominators of 


If the canonical product of an integer n is n= p,4- - - and if pi*|d, 
pit td for i = 1, 2,---, 7, we define n* by the equation n* = p,'*% - - p,trtar, 

Let = d P(x) so that ®(z) is a polynomial with integral coefficients. 
For every prime p let 6 be the highest power of p for which ®’(z) =0 (mod p’) 
for every integer Let Po(z) =p Let M(m) denote the number 
of solutions of 


(2. 1) P(av) == n(modm), <m*. 


For m = p' let N(p') denote the number of solutions of (2.1) in which not 
every P,(zyv) is divisible by p. 

Lemma 1. (Hua, Lemma 35). If d=p'D, where (p,D) =1, 
1 = max(26 + 2—t,6+ 2) and + dzp'*", then 


P(x) =P(y) + 2p'*Po(y) (mod p'), 
Po(x) = Po(y) (mod p). 


Proof. If we expand ®(x) = @(y + dzp'-**) by Taylor’s Theorem we 
obtain 


k 
jez ! 


Since (¢ 2t+1 we have 
= + (mod p**'), 
dP (x) =dP(y) + zDp****P,(y) (mod p***), 
p'P(x) = p'P(y) + zp***Po(y) (mod p***), 
P(x) =P(y) + zp'*Po(y) (mod p’). 


This proves the first result of the lemma and the second follows in a similar way. 


It 
tw 


Th 
upe 


th 
H 
Tl 
(2 
(2. 
| He 
anc 


t 


REPRESENTATION OF INTEGERS. II. 395 


Lemma 2. (Hua, Lemma 36). Jf /=max(26+2—t, 0+2) then 
N(p') (p"*). 


Proof. The argument is very similar to that used in the proof of 
Lemma 3,1. If d= ptD where (p, D) =1, then p’* = p**! so that N(p’) is 
the number of solutions of 


(2. 21) P(2) (mod p'), OS < p**!, ppevery Po(av). 
y= 


Hence N(p*) is equal to D-* times the number of solutions of 
(2. 22) » P(zv) =n (mod p'), OS ay < Dp**!, pfevery Po(av). 
y=1 


For every av in (2. 22) let zy = [2,/(Dp**'-**)] so that 


0 = Yv < er, 


Ly == Yv 0 < av< 


Then by Lemma 1 we may write (2. 22) in the form 


> P(y») + evPo(yr) =n (mod 
0S < Dp", 0S < p™, pfevery Po(yv). 


It follows that to each solution of (2.22) there corresponds a solution of the 
two congruences 


(2. 23) P(y) =n (mod p'), 0S < p}every Po(y), 


(2.24) =p P(yv)) (mod p), OS a < p™. 


By the same method of proof as that used in Lemma 3, I, it can be shown that 
(2.23) has D*p-**N(p') solutions and that (2.24) has p**** solutions. 
Hence (2. 22) has solutions. Then 


and this completes the proof. 


3. The proof of Theorem 1 when a= 4c (mod 8). As explained in I, 
Theorem 1 is a consequence of Theorem 3, I, and this theorem in turn depends 
upon the fact that 


) 

8 & 
a 8 


396 R. D. JAMES. 
(3. 11) is) ere, l=y, 9, 


where y is some fixed integer. If the proof of (3.11) as presented in I is 
examined, it is found that the restriction a4 4c (mod 8) was used only when 
b= 6c (mod 8), c odd. This was in (6.41). Hence in this section we shall 
prove that (3.11) is true when P(z) has the form (1.1) with a=4e, 
b = 6c (mod 8) and ¢ odd. 

In sections 4 and 5 of I we have shown that the number of solutions of * 


(3. 12) > P(vay + t) =n (mod p’), 0= a < p' 
p=1 


is = ps) when p= 3. Since p'* = p' when P(z) is a cubic polynomial 
and p > 3, it is evident that the number of solutions of (3.12) is the same 
as the number of solutions of 


(3. 13) > Py) =n (mod p’), 0S < p*™. 


The congruence (3.13) has M(p") solutions by definition. Hence we have 
M(p') = pp )) when p > 3, and this proves (3.11) when p>3. Ina 
similar manner it can be shown that (3.11) is true with y= 2 when p=3 
and 3|a, for in this case v = 1. | 

Thus two cases, p= 3, 3a and p—2, remain. We shall dispose of 
them in Lemmas 3 and 4. 


Lemma 3. If p=3, 3}a then (3.11) is true with y=1. 
Proof. In this case we have d = 3, 3* — 9, ®(x) = 3P(z2), 
(xr) = 3a(xz? + x) /2 + (6b — — (a + 3b — 6c) /2. 


From this equation it is evident that 3}®’(z) for all values of 2 since 3}. 
Hence 6 = 0 and = @(z). 
We distinguish two cases. 1). Suppose 3|b. Since 


P(x#+1) +2P(9—z) + P(#--1) =az (mod 3) 
there exists a solution 2 of the congruence 
+1) + 2P(9—2) + P(a)—1) =n (mod 8). 


Hence the congruence 


5 
> P(av) =n (mod 3), 0= xy < 3* 
y=1 


(1. 41)—(1.44) of I for the definition of v and t. 


an 


ha 
to 
(3 
T 

As 
of 
in 
wl 
( 

H 
sl 

If 
ha 
th 
WI! 


REPRESENTATION OF INTEGERS. II. 397 


has the solution 7, 0, 72 = 2) +1, = =9—4X, V5 = X%—1. Also, 
P,(0) =— (a + 3b — 6c) /2 and this expression is not divisible by 3. Hence 
for s = 5, 1 = 2 = max(26 + 2—t, 6+ 2), it follows from Lemma 2 that 


This proves the lemma when 3|b. 
2). Suppose 3}b. In this case we have 


P(x) + P(9— x) = baz? (mod 3) 


and so there is a solution of the congruence * 


3 
 [P(av) + P(9 — av) |] =n (mod 3). 
As before we have a solution y; Yyov =2v, = 9I— v—1, 2, 3, 
of the congruence 


=n (mod3), 0S yy <3*, 


in which P,(0) is not divisible by 3. Thus (3.2) is proved in this case also 
with s = 7. 


Lemma 4. If p=2 then (3.11) is true with 
Proof. In this case we have d = 1 or 3, 


®(2) = da(x* — x) /6 + db(a?—2)/2 + dex 
®’ (xr) = da(z? + x) /2 + d(2b —a)x/2 — d(a + 3b — 6c) /6. 


Using the relations a= 4c, b= 6c (mod 8) it is easily seen that =0 
(mod 4) for all values of z, but that ®’(z) #0 (mod 8) for all values of z. 
Hence 6 = 2. 

If n is even let 2) —0 or 1 according as n= 2 or n==0 (mod 4). Then 
since P(0) 0 and P(1) —c is odd, we have n—2P(2z,) =2 (mod 4). 
If n is odd let r 1 or 8 according as n= 3c or n=c(mod4). Then we 
have n—rP(1) =2(mod4). This shows that we can always write n in 


the form 
n=PrP(x) + 2m, 


where r = 1, 2, or 3, and n, is odd. 


*E. Landau, Vorlesungen iiber Zahlentheorie, Bd. I, Theorem 301. 


398 R. D. JAMES. 


Now P(x) + P(32 —z) = bz’ (mod 32) and since 6/2 is odd there 
exists a solution of the congruence ° 


(6/2) = n, (mod 16) 


in which z, is odd. It follows that 


[P(2v) + P(32—ay)] = 2n, (mod 32), 0Say<16 


y=1 


and hence that 


rP + [P(av) + P(32—2»)] (mod 32). 


Moreover at least one of P,(z,) and P,(32—~2z,) is not divisible by 2. «or 
if both were divisible by 2 we should have 


dbx,/2 = — P,(32 =0 (mod 2), 
whereas d, 6/2, and z, are all odd. Then for 


s=92>r+6, 1 = 6 = max(26 + 2— t,6+ 2), 
we have 


M(2") = N(2") 28-1 N (2-1) (e-1) (32) = Q(1-5) (8-1) | 


This completes the proof for the case p = 2. 


THE UNIVERSITY OF CALIFORNIA, 
BERKELEY, CALIFORNIA. 


5K. Landau, Vorlesungen iiber Zahlentheorie, Bd. I, Theorem 301. 


TE 


irr 
at 
of 
p=1 
(1 
(2 
be 
jun 
the 
if 1 
det 
(2 
cor 
bee 
of 
wh 
of 
K, 
Am 
me 


THE CONJUNCTIVE EQUIVALENCE OF PENCILS OF HERMITIAN 
AND ANTI-HERMITIAN MATRICES.* 


By JoHN WILLIAMSON. 


Let K be a commutative field of characteristic zero and let K(1) be a 
quadratic adjunction field of K, where 1 is a zero of the polynomial x? —a, 
irreducible in K. If A is a matrix with elements in K (1), or more shortly 
a matrix over K (i), the matrix A* is defined to be the conjugate transposed 
of A, so that A* — A’. In particular, if A is a matrix over K, A* = A’, the 
transposed of A. Let 
(1) A=rA-+ sB, 


be a pencil of matrices, in which 
(2) A* == eA, B* = $B, ¢,5== + 1, 


so that A is either hermitian or anti-hermitian and so is B. Let A, =7A,-+ sB, 
be another such pencil. Then the two pencils A and A, are said to be con- 
junctively equivalent, if there exists a non-singular matrix P over K(t) 
such that 

PAP* = A;; 


that is, if PAP* — A, and PBP* = B,. When the matrices A, B, A,, B, are 
all matrices over K, the two pencils are said to be congruently equivalent, 
if there exists a non-singular matrix P over K, such that 


PAP’ = 


There are accordingly two distinct problems to be considered; (a) to 
determine necessary and sufficient conditions that two pencils, which satisfy 
(2), be conjunctively equivalent and (b) to determine necessary and sufficient 
conditions that two such pencils be congruently equivalent. Problem (a) has 
been solved completely, for the case in which e861, when K is the field 
of all real numbers? and also, under the restriction that B be non-singular, 
when K is a commutative field of characteristic zero.? In the other two cases, 


* Received November 30, 1936. 

*H. W. Turnbull, “ On the equivalence of pencils of hermitian forms,” Proceedings 
of the London Mathematical Society, vol. 39 (1935), pp. 232-248; M. H. Ingraham and 
K. W. Wegner, “ The equivalence of pairs of hermitian matrices,” Transactions of the 
American Mathematical Society, vol. 38 (1935), pp. 145-162. In both papers a treat- 
ment of singular pencils is given. 

*John Williamson, “The equivalence of non-singular pencils of matrices in an 


399 


400 JOHN WILLIAMSON. 


in which one or both of ¢, 6 have the value —1, problem (a) may be reduced 


to the case in which e=8=—1. For example, let e——1, §6—1, so that 
A =— A*, B= B*, Then, (1A)* =7A and 
(3) = tC + sB, 


where t=r/i, C = C*, B = B* and (3) is a pencil of hermitian matrices, 
Hence, when A is a non-singular pencil, problem (a) is completely solved. 
Problem (b) has been solved for the case, in which « =6§ 1, when K 
is the real field * and also, under the restriction that B be non-singular, when 
K is a general commutative field of characteristic zero, (J). It is however not 
possible to reduce the other cases to this one and they must be considered 
separately. The problem has also been solved for a non-singular pencil and 


general field K, when «1, 6=—1.* The remaining problem when 
e« = § = — 1, so that both A and B are skew symmetric matrices, is considered 


here. It is shown that two skew symmetric matrices are congruently equiva- 
lent, if they have the same kronecker minimal indices and the same invariant 
factors.° This is a much simpler result than those obtained in the other cases; 
but it is only natural that this be so, since two skew symmetric matrices of the 
same rank are congruently equivalent, while the same is not true of two 
symmetric matrices. 

Since, when A and B are both skew symmetric matrices of odd order, 
every matrix of the pencil A is singular, a treatment of singular pencils is 
absolutely necessary. Accordingly the conjunctive or congruent equivalence 
of two general pencils A, which satisfy (2), is first considered, so that the 
solution of both problems (a) and (b) is completed in all cases. The method 
here adopted is quite distinct from that used in the discussion of non-singular 
pencils of hermitian matrices,® and has the advanatge that at no stage is 4 
change made in the basis of the pencil. 

Section 1 is devoted to the proofs of subsidiary lemmas, section 2 to the 
consideration of singular pencils, section 3 to that of non-singular pencils in 
which B is singular, and 4 to the reduction of a non-singular pencil of skew 


symmetric matrices to canonical form. 
arbitrary field,” American Journal of Mathematics, vol. 57 (1935), pp. 475-490. This 
paper will be referred to as I. 

* Turnbull, loc. cit. 

‘John Williamson, “On the algebraic problem concerning the normal forms of 
linear dynamical systems,” American Journal of Mathematics, vol. 58 (1936), pp. 141 
163. This paper will be referred to as II. 

° This is a well known result in case K is algebraically closed. See L. E. Dickson, 
Modern Algebraic Theories, p. 125, or C. C. MacDuffee, The Theory of Matrices, p. 61. 

®* Turnbull, loc. cit.; Wegner and Ingraham, loc. cit. 


(4 
(5) 
of 
j co 
(6) 
is a 
(7) 
whe 
of o 
R; a 
(8) 
Ther 
(9) 
wher 
squa 
Wher 
and 
(10) 


ANTI-HERMITIAN MATRICES. 


PENCILS HERMITIAN AND 


OF 
1, If A=rA + sB, we define the matrix pencil A” by 
(4) A” = erA* + 8sB*. 


Let p; be the matrix pencil 


Or 0 0 

sas 
(0 0 0 r 


of j rows and j7-++-1 columns. Then p;” is a matrix pencil of 7 + 1 rows and 


j columns, while the matrix pencil 


(pi 9 
(6) 


is a square matrix of 27 + 1 rows and columns. Let 


where #; and U; are respectively the unit matrix and the auxiliary unit matrix 
of order j.7 We now prove a few elementary lemmas involving the matrices 


Rj and Nj. 
Lemma I. Let S and T be two matrices, which satisfy 


Then, if p < q, the first row of S is zero; if pq, 


(9) 
021 G22 


where oy. and on; are diagonal matrices of orders p+ 1 and p respectively. 


Proof. Since S and T satisfy (8) we may write S and T as two rowed 


square matrices of matrices, 


where o,, is a matrix of p + 1 rows and q columns, o;2 a matrix of p + 1 rows 
and g-+ 1 columns etc. From (8) we immediately deduce the four equations 


(10) 011Pq = Pp 7115 = Pp T12, = PpT21; T22Pq PpTe22- 


*See Turnbull and Aitken, Canonical Matrices, p. 62. 


12 


O11 O12 T= T11 T12 
O21 G22 T21 22 


402 JOHN WILLIAMSON. 
The first of these equations is of the form, 
(11) Coq = pp D 


where C = (c;;) is a matrix of p+ 1 rows and q columns, while D = (dj;) 
is a matrix of p rows and g-+1 columns. As a consequence of (11) we have 


-f- SCi,j-1 = erdi; + (i= 1, 5, j=1, 2, 1), 


with the understanding that 


(12) do,j dps1,j = (). 
Therefore, 
(13) = 8di_s,j; €9Ci+1,j-15 


and we see from the last of these equations that the elements in any counter 
diagonal of C are the same except perhaps for sign. But, as a consequence 
of (12) and (13), 
= = 0 
and also 
Ci-1,1 = €8Ci,o = 0, 2,3,---,p-+1). 


Hence the first column and the last row of C are zero, and consequently C is 
zero. Therefore o;, is zero. 

The second of equations (10) is of the form Cpg” —p,)”D, where 
C = (cj) is a matrix of p+1 rows and g+1 columns, while D = (dij) 
is a matrix of p rows and q columns. Consequently 


= Ci, = Cig = jar, 


(t= 1, 2,- j =1,2,- ‘ 
so that the elements in any diagonal of C differ at most in sign. But 


C1,j41 = 8do,j = 0 and Cp+1,j = 0. 
Hence 
= 0, +0) 
Cp+1,5 = 0, (j = 1, j 


Therefore, if the number of rows of C is less than the number of columns, 
i.e. if p< q, C=0 and, if p—q, C is a diagonal matrix. Hence, if p< 4 
= 0 and, if p= is a diagonal matrix. Moreover, if p= 4q, D is also 
a diagonal matrix, whose first element is e times the first element of C. We 
have as a result the corollary: 


Corotuary I. If T = 8* and is non-singular, 18 non-singular. 


om 


the 


of 


wh 
sq 
wh 
H 
blo 
(14 
Let 
pol 
| an¢ 
Sin 
the 
kro 
ind 
dep 
mat 
(15 


PENCILS OF HERMITIAN AND ANTI-HERMITIAN MATRICES. 403 


The proofs of the following lemmas, which are similar to the above, are 
omitted. 

LemMa 2. Jf SN; = R,’T, the first row of S 1s zero. 

Lemma 3. If M isa square matrix of order t and + = 
the first row of S 1s zero. 

Lemma 4. If SN; =WN;"T andi < the first row of S 1s zero. 

LemMA 5. If S(rM+ sH) =N,T, the first row of S is zero. 

When G is a square matrix of order n, we may consider G as a matrix 
of matrices and write 

G = (Gi;), (1,7 = 1, -,t), 


where Gj; is a matrix of nj rows and n; columns. If H is a second n-rowed 
square matrix 
H = (Hi;), (1,7 


where H;; is also a matrix of n; rows and n; columns we shall say that G and 
H are similarly partitioned. If Gij = 0, when i> J, we shall call G a diagonal 
block matrix and write 

G = +, Get]. 


2. Let A be the pencil defined by (1) and (2) so that 
(14) A A”, 


Let A annihilate the column vector u of dimension n, whose elements are 
polynomials in 7 and s with coefficients in K(i). Then 


Au = 0, identically in r and s, 
and consequently 
O = wu’ A, by (14). 


Since wu” is the row vector obtained from u* by replacing r by er and s by 8s, 
the degree of w’”’ in r and s is the same as the degree of u. Therefore the set of 


kronecker minimal row indices*® coincides with the set of minimal column 


indices. In particular, if u;,u2,° * -, Up, form a complete set of linearly in- 
dependent vectors over K(i) annihilated by A and, if V is any non-singular 
matrix, whose first p columns are the vectors u,, U2,° °°, Up, then 

(15) V*AV = [0, A, ], 


*Turnbull and Aitken, op. cit., pp. 119-125. 


404 JOHN WILLIAMSON. 


where A, is a pencil of the same type as A but of order n — p. Moreover none 
of the minimal row (or column indices) of A; is zero. Since (15) is a con- 
junctive transformation and since, when A and B are matrices over K, V* = JV’, 
we may consider the pencil A, instead of A. Accordingly without any loss 
of generality we may assume that no minimal row or column index of A 1s zero, 

Let II be any pencil equivalent to A in the more general sense that there 


exist two non-singular matrices Y and P, such that 


QAP 


Then, 
P*aAP = = HI, 


so that the pencil HII is conjunctively equivalent to A. Accordingly, as a 
consequence of (14), 
(16) = H*. 

We now prove 


Lemma 6. Let 1 =—[Il,, be a diagonal block matria, where ts 
of order ni, and let the matrix H,, formed from the first n, rows and columns 
of H, be non-singular. Then, if equation (16) is satisfied, there exists a non- 


singular matrix W such that 
is also a diagonal block matriz. 


Proof. Let H = (Hi;), i, 7 =1,2, be a partition of H similar to that 
of 11. Then H,, = H, is non-singular. 
Equation (16) implies 


A, = 0,” A* ji, (i, =1, 2), 
and consequently 


It now follows by a simple calculation that, if 


0 


where #; is the unit matrix of order nj, 
wiiw* H,, Hz | II, ]. 


Since the matrix W depends solely on H we have the 


wh 


to t 


(1 
In 
set 
an 
wh 
| sin 

(19 
by 
sam 
turb 
- 
the 
The: 
one 
at le 
we 
from 
| (20) 
Edinb 


PENCILS OF HERMITIAN AND ANTI-HERMITIAN MATRICES. 405 
CoroLuary. Jf H is a matrix over K, W is a matrix over K and W* = W’. 
We now take II in the canonical form ® 
(17) TI == [R;,, By? Rig Nt +, 1M + 8H]. 


In (17) each matrix Rj, is defined by (6) and 4;, j2,° form a complete 
set of minimal row (or column) indices, each matrix N+, is defined by (7) 
and corresponds to an elementary factor r‘* of A; and rM + sH is a pencil 
whose second member is non-singular and can therefore be taken in this 
simplified form. 

For convenience we write (17) in the form 


(18) I] == [71, "9 Tk+ms1 


where 7; = isi = Nt, << M3 Thames = TM + SH. 
If H = (Hy), p,g =1,2,: +m +1, is a partition of H similar 
to that of II in (18), it follows from (16) that 


(19) = H* gp. 


Let the integers j;, j2,: * *, jx in (17) be so arranged that 


If H,, is singular and, for some value of pc, Hpyp is non-singular, 
by a conjunctive transformation involving an interchange of rows and the 
same interchange of columns, we may interchange H,, and H», without dis- 
turbing II. By (19) and Lemmas (1), (2) and (3), the first row of Hpq is 
when and g and contains at most one element in the 
(ji + 1)-th place, different from zero when pc and qc. Further, by 
the corollary to Lemma (1), hp, is zero, if, and only if, Hyp is singular. 
Therefore, if H,, is singular, h,, 0, and since H is non-singular at least 
oe element in the first row of H is different from zero. Accordingly, for 
at least one value of pc, hi» ~0. Without any loss of generality, then, 
we may assume that the first row of H,, contains the element h,. distinct 
from zero. 

Let # be the unit matrix of order 2j7 + 1 = 2j, +1 and let 


EE E 
20 = 


*'W. Ledermann, “ Reduction of singular pencils of matrices,” Proceedings of the 
Edinburgh Mathematical Society, vol. 4 (1935), series 2, pp. 92-105. 


406 JOHN WILLIAMSON. 


Then 

Ws (Rj, Rj] = (Gog) Bj], 

W2( Hp) [R;, Ry |W*, = (F'pq) R;], (p, q = 1,2), 
where 


Gis Ay, + Hy. + Ho, + Fy, = + (He, — — 


Let gi; and f,; be the elements in the first row and (j + 1)-th column of 
the matrices G,, and F,, respectively. Then 
(21) = hie + he, + has + hee, fu hie) + hay — Phoo. 

If H,, and Hz, are both singular, h;; = he, = 0 and at least one of gi; or fy 
is non-zero, as otherwise h,2 would be zero. Therefore at least one of G,, or 
F,, is non-singular. 

If, however, we are restricted to congruent transformations over the 
field K, we are not at liberty to use the transformation W.. But, if 
Q = Lj, 

R;\Q [— [ Rj, Rj], 
and 
Q (Hoa) ]Q’ = Bi], (p,q = 1,2), 
where 
Ky. = [— Ey Ko = H,,[— 


Accordingly, if W. == W,Q, equation (21) becomes 
== hie + hes, fis = ho, — hi, 


so that once again either G,, or F, is non-singular. Therefore, we may 
suppose that the necessary transformation has already been made and that 
H,, is non-singular. As a consequence of Lemma (6), there exists a non- 
singular matrix P,, such that 


P,AWP*, = 
By repeating this process k times we prove the existence of a non-singular 
matrix P such that 
where = [Ni,, +, Ni,,rM + sE]. All matrices H; in (22) are 
non-singular and satisfy the equations 


To simplify equations (23) we write Hi = S, Rj, —=R,, and with the 
notation of Lemma 1, have 


wh 


ant 


in 


(2 

Rp 

(2 
Ae 
(2 
wh 
sin 

CO 

che 
is 

cor 
fol 
(2 
an 
(2: 


PENCILS OF HERMITIAN AND ANTI-HERMITIAN MATRICES. 407 


0 012 
S = ( 3 
O21 G22 
where oi2 and oz; are non-singular and satisfy 


(24) Cispp = pp 22. 


Let W be the matrix 


W ( 0 ) 
40220127 Ey 
Then, 
R,W* — ) )*o Pp ) by (24) 
Pp 'p 
2 992 
4p+1 
and 


WSR,W* = ( 


Therefore each matrix H; in (22) may be reduced to the form J;,, where 


0 
(25) 


Accordingly we have proved, 

THEOREM |. There exists a non-singular matrix P such that 
where I; 1s defined by (25) and Rj by (6), while the pencil Hy: is non- 
singular. 

Corottary 1. Jf the elements of A and B lie in K, the matrix P lies 
im K and P* = P’. 

CoroLtiary 2. The canonical form on the right of (26) 1s determined 
completely by the minimal indices of A and the non-singular core Hy. 

3. In considering the reduction of the non-singular case we could, by a 
change of the basis of the pencil, reduce it to one in which the second member 
is non-singular. As a change of basis is quite distinct in nature from the 
conjunctive transformations of the pencil it is more satisfactory to proceed as 


follows.1° For simplicity we write 


and 
(28) Hin = H = (Hy), (i, 7 =1,2,--°,m+1), 


20Cf. Ledermann, loc. cit. 


408 JOHN WILLIAMSON. 


where (28) is a partition of H similar to that of II in (27). If 


(29) t, = 1, =t,> ; = tay, 


> 


it follows from Lemmas (4) and (5) that the first row of H:p —0, when 
p > c, and that at most one element in the first row of Hyg is different from 
zero, when pc. If H,, is singular, we may suppose, as in the previous 
Hi; Hy. 
Hn 
is non-singular. Therefore by several applications of Lemma 6 we may reduce 


section, that H,. is non-singular and, since H,. —ef*,,, that 


the pencil HII to a diagonal block form where each block is either of type () 


H,,N; or of type (8) 


It is now necessary to consider ee distinct cases ; 
(1) e=—s—1, (2) e=+1, (3) 


Case 1. Both matrices in the pencil are hermitian or both symmetric. 
A block of type (8) may be reduced to two of type (a) (I page 481). The 
matrix H,, in («) is of the form, 


Ay, =T + +: + 


where 7; is the counter unit matrix of order tf. 
Finally H,, may be reduced to the form 


(30) Ay 
where g;, lies in K. 
In (30) g, is not uniquely determined. In fact, the diagonal matrix 
=[91,92,' °°, 9c], where c is defined by (29), may be replaced by the 
diagonal matrix F = [f1, f.,- - -,fc], provided F and are conjunctively 
equivalent (I pages 482-487), o1 


THEOREM 2. If rt occur exactly c times among the elementary factors 
of a pencil of hermitian matrices A, in the canonical form for A there is a block 
GcT't]. The diagonal matrix [ 91, 1s determined 
apart from a conjunctive transformation. 

Theorems 1 and 2 together with the theorem in I, page 487, give a com- 
plete solution of problem (a). 


Case 2. As stated in the introduction, it is only necessary to consider 
congruent transformations of pencils with elements in K. The matrix H,; 3s 
symmetric, while the matrix H,,U; is skew symmetric. Therefore 


(31) H,,U =— U’H’,, = — U’H,,, where U = U;. 


wh 


rec 
we 


wr 


Th 


wh 
by 
ele] 
we 


fac 


[9 


is 


| 


AND ANTI-HERMITIAN MATRICES. 


PENCILS OF HERMITIAN 


Let 
0 — 1 0 
A . . . 
(-— 1) t-1 0 
Then XU =— U’X, and as a consequence of (31) Hi, XG, where G is 
commutative with U. Hence 
(32) Hy, =Xf(U), 


where f(U) is a polynomial in U, with coefficients in K. 


Hence, if ¢ is odd, 
f(T) 7) — gO"), 
while, if ¢ is even, 


Let t be odd. The congruent transformation by the matrix W, of (20) 
reduces a block of type (8) to one in which H,,; is non-singular. Therefore 
we need only consider blocks of type (a). Let the matrix f(U) in (32) be 


written as 


f(u) =gE + y, where y = y(U*?) = U*y,(U?). 


Then, if W = — 


X (HE — tyg*) (g# + 

—X (gh +7°4)N, 
where ¢ is a polynomial in U*. Since y? contains a factor U*, it is possible 
by a succession of such transformations to reduce H,, to the form XgF. The 


WXf(U)NW 


element g is not unique. By an argument similar to that of II, pages 152-154, 


we have 


THEOREM 3. If ¢ is odd and r* occurs c times among the elementary 
factors of the pencil A, in the canonical form of A occurs a block matrix 
+, where the diagonal matrix [g1,92,° 9c] 


\s determined apart from a congruent transformation. 


Let t be even. Then H,, is singular and only blocks of type (8) may 


occur. It is easily shown that 


409 


JOHN WILLIAMSON. 


where ’) , N] = [N”, and 


N] = [N”, 
Therefore, 


(E— 347") (vy + 4) LN, N] — 3677)’ = — dy") (y + 6) — 


= (y— + toy "by"¢) [N, 
= (v1 + $1) LY, 

where 


and X U* 922(U ) 


Since y is non-singular, y, is non-singular, and we may repeat this process 
with y replaced by y: and ¢ by ¢,. After ¢/2 repetitions the matrix ¢1/2 is 
the zero matrix, since U'=0. Hence a block of type (8) may be reduced 
to the form 


(33) 0 N 0 


H,, 0 
where H,.N Tf’... But, 


E 0 0 Hiw\fN 0 
(34) 0 le | 
-(° Hi 0 N 


Therefore we have proved, 


THEoREM 4. If ¢t is even and r* occurs c times among the elementary 
factors of A, then ¢ is even. sda Segall to each pair of elementary factors 


r', rt, there ts a block matrix tig . in the canonical form of A. 


0 


Theorems 1, 3 and 4 complete the solution of problem (b) when « =1, 
§ = — 1. 


Case 3. The matrices H,, and H,,U are both skew symmetric so that 
H,, = U’H,,. If T is the counter unit matrix of order ¢, 

TU =U’T and H,,=—Tf(U). Since Hy, =— 

Tf(U) =—f(U)T =—Tf(U), so that f(U) =0 and H,, =0. 


0 
Therefore we need only consider blocks of type ) 
21 


0 N. 


cai 


410 
| 
f 

a 

W 

( 

tl 

ir 

E 
if 

( 

t 

(i 
| 
i 
(I 
| 
[ 

T 
far 


PENCILS OF HERMITIAN AND ANTI-HERMITIAN MATRICES. 411 


0 N” 


N 0 by (34). 


which is of the same nature as (33) and can be reduced to ( 


Therefore, 


~ 


THEOREM 5. Jf r' occurs c times among the elementary factors of a skew 
symm pencil A, ¢ 1s even. Corresponding to each pair of elementary 


factors r‘,r* there 1s a block matria i 0 ) in the canonical form of A. 


3. In sections 1 and 2 we have proved that there exists in all three cases 
a non-singular matrix P such that 


P*AP = Ao] 
where the form of A, is determined and 
(35) A, = H(rM + sE), 


the matrix H being non-singular. 

As mentioned in the introduction a normal form for A, has been determined 
in case 1 (1) and also in case 2 (11). We now consider case 3. The matrices 
H and HM are both skew symmetric, so that HM = M’H and, consequently, 
if Q is any non-singular matrix satisfying the equation 


(36) QM = M’Q, 
then H = QG, where GM = MG. 

Let the elementary factors of rM + sH be the homogeneous polynomials 
(37) pi(r, (t= 1,2,---,¢; j= 1,2,: 


where pi (r,s) is irreducible in K[r, s]. 
We may take M in the canonical form 


M =[M,, -, Mz), 


where the elementary factors of rM;,-+ sH, are the polynomials (37), when 
i=k. Since there exists a non-singular matrix Q, such that QM; = M.Qx 
(I page 478), the matrix 9 = [Q;, Qo,: - -,Q+] is non-singular and satisfies 
(86). Any matrix G commutative with M is also a diagonal block matrix 
- -, and consequently 


H = OG = [Q:G4, 


Therefore it is sufficient to consider a pencil A, in (35), whose elementary 
factors are all powers of the same irreducible polynomial p(r, s) i.e. are 


™ John Williamson, “The idempotent and nilpotent elements of a matrix,” Ameri- 
can Journal of Mathematics, vol. 58 (1936), pp. 747-758. 


412 JOHN WILLIAMSON. 


Accordingly we take M in the canonical form 


(38) M =[L,, L2,: +, Lt], 
where 

(39) Li =p: E,+e-U;, and write 
(40) + se- 


In (39), #;, and U; are respectively the unit matrix and the auxiliary unit 
matrix of order 4; p is the companion matrix of p(1,A); e the unit matrix 
of the same order as p; and - denotes direct product. (I page 477). For 
example, if 4; = 3, 


pe 0 
0 0 p 


It has been shown, I page 490, that there exists a non-singular symmetric 
matrix g such that gqp=—p’q. Hence, if T; is the counter unit matrix of 


order Nis 
Lite: Ui) =(p = Liq: Ti. 
Therefore the matrix 


Ts] 


satisfies (36), and is symmetric. 
Let 
G = (Gij), (1,7 = 1,2,- *,t), 


be a partition of a matrix G similar to that of M in (38). If G is commutative 
with M, the form of G is known.?? In fact, if »% = nj, 


Si; 
(41) -( Gji = (0, Sj), 


where 8;; and Sj; are square matrices of order 7» = 7;, while 0 is the zero 
matrix of 4; — 7; rows and y; columns. Further 


9-1 9-1 
(42) = D 844 = J 3%, 
a=0 a-0 


where sijx = 8ijz(p) is a polynomial in the matrix p. 
Since H = QG is skew symmetric, 


0G =— =— GQ. 


12 John Williamson, The Idempotent and Nilpotent Elements of a Matria, p. 457. 


i 

I 

{ 
i 
S 
m 
p 
bl 
sy 
ti 
in 


PENCILS OF HERMITIAN AND ANTI-HERMITIAN MATRICES. 413 


Hence g: = — @’jiq: Tj, or, when 1S j, 


te 
q T; q Tj. 


From this we deduce 
But, by direct calculation from (42), 


and hence 
(43) Sij = — 


In particular Gi; = Si, = 0. Let > nen. Since G 
ig non-singular and, when j > ¢, the first column of Gj, is zero, Gi, must be 
non-singular for at least one value of 1,2 ic. We may therefore assume 
that G.,; is non-singular and, as a consequence of (43), that 


= ( 0 
Goi Goo Goi 0 


is non-singular. Therefore by Lemma 6 there exists a non-singular matrix P 


such that 
0 0 ) ] 
P [ 0 ) ( 0 N, HAs 


Since the block ie. 0 )( 0 he is of the same type as (33), it may be 


reduced to the block matrix } 


We have therefore proved. 


THEOREM 6. Hach elementary factor p(r,s)” of a pencil of skew sym- 
metric matrices must occur an even number of times. Corresponding to each 
pair of elementary factors p(r, s)", p(r, s)”, in the canonical form is a matria 


block (y “7 where N is defined by (39) and (40). 


Combining this with the results of sections (1) and (2) we have finally, 


THEOREM 7%. Necessary and sufficient conditions, that two pencils of skew 
symmetric matrices be equivalent under a non-singular congruent transforma- 
tion in K, are that the two pencils have the same kronecker minimal row 
indices and the same elementary factors. 


THE JOHNS HOPKINS UNIVERSITY. 


— 


SOME REMARKS ON CLASS FIELD THEORY OVER INFINITE 
FIELDS OF ALGEBRAIC NUMBERS.* 


By O. F. G. ScHILLING. 


Mr. M. Moriya recently investigated the theory of finite abelian extensions 
over infinite fields of algebraic numbers. He has shown ‘that under certain 
restricting conditions on the infinite algebraic ground field there exists an 
analog to the classical class field theory: the finite abelian extensions of an 
infinite field can be characterized by class groups of ideals in the groundfield. 
His results can be completed in several directions. In this note we shall 
characterize the finite algebraic number fields by an intrinsic property of the 
given field: a number field is finite if and only if there exists a finite number 
of cyclic superfields of some prime degree with a given defining modulus. 
Furtherfore we shall prove the analog to the theorem on arithmetic progres- 
sions for a certain class of infinite fields. Finally we discuss the norm theorem 
of Hilbert and Hasse. This connects our investigation with A. A. Albert’s 
results on algebras over infinite number fields.? 

Let & be an infinite field of algebraic numbers over the field P of all 
rational numbers. The field & can always be approximated by an enumerable 
tower of finite algebraic number fields k; over P; that is to say & is the join 
> k; of finite fields k; such that 


ae -Ch,CkC- --C3k=—k. 


With the field & there is associated a Steinitz G-number N(k, P), the absolute 
G-degree of k. The number NV(k,P) is defined as the formal least common 
multiple of all the relative degrees [h: P] where h is any finite subfield of k. 
If a prime p divides almost all degrees [h:P]—or almost all degrees 
[ki:ks+] is sufficient too—then we say that p~ divides N(k,P). Thus 
N(k, P) can be uniquely decomposed into an infinite part NV. .(&, P) consisting 
of the product of all p©|N(k,P), and a finite part NV fin“ P) which consists 
of the exact powers of those primes p which divide only a finite number of 


* Received December 18, 1936. 

‘1M. Moriya, “Klassenkérpertheorie fiir einen unendlichen Zahlkérper.” Will 
appear in the Journal of the Faculty of Science, Sapporo (Japan). 

2A. A. Albert, “Normal division algebras over algebraic number fields not of 
finite degree,” Bulletin of the American Mathematical Society, October, 1933. 


414 


pr 


vol, 


te 
fi 
i 
k 
al 
tl 
of 
di 
pl 
ex 
ar 
cy 
(1 
Tl 
no 
po 
0c 
of 
By 
di 
r( 
an 
fol 


SOME REMARKS ON CLASS FIELD THEORY. 415 


relative degrees [h:P]. Moriya has shown that exactly all those abelian ex- 
tensions K of k whose degrees [K:k] ==n are prime to N,(% P) are class 
fields, that is to say the Galois groups G(K,k) of K over k are isomorphic with 
class groups a/(H(K,k) derived from the group a of all ideals in & that have 
inverses.* 

It is not difficult to construct infinite fields & whose infinite parts 
N., (4, P) of the respective G-degrees N(k,P) are equal to one. Such fields 
k have then the property that all abelian fields K over & are class fields. There 
arises the problem of finding properties of these infinite fields & that distinguish 
them from the finite algebraic number fields. 

Now let k, be a finite algebraic number field which contain the [-th roots 
of unity £, (I-42). By f we dencte an integral ideal of k whose prime 
divisors we shall later specify, and by « the product over all the infinite prime 
places of ko. Assume that f is chosen in such a fashion that there exist cyclic 
extensions Z of k, whose degrees are equal to J and whose conductors f(Z, ko) 
are divisors of foo. Such moduli fo always exist. The number of different 
cyclic fields Z of this type shall be denoted by R(ko, l, fo). 


Let the prime divisors of J in ky be I,,- - -,!.; we obtain a decomposition 

(1) Te, and have N(ko, = (1) where e(i)f(t) = P]. 
i=1 i=1 

Then the numbers w(ko, = e(1)l(1—1)7! are always integral.’ Suppose 


now that the ideal f= ][ pI] is chosen in such a way that the ex- 
Vile 


ponents v(t) are greater than or equal to the numbers w(ko, I;) and that there 
occur sufficiently many prime ideals p. Then there surely exist cyclic fields Z 
of degree over whose conductors f(Z, k) divide fo. 


Lemma 1. If Kk’, is a finite extension of ko whose relatwe degree n’ is 


prime to l then 
l, foo) > R (ko, l, fo ). 


Proof. First we give an explicit expression for the number R(ko, 1, fo). 
By w(p) and w(f) we denote the number of prime divisors p and [ which 
divide f. The number of fundamental basic units of ky is equal to 
r(ko) = [ko: P] 2-* —1 because there exist no real infinite prime places in ko 
and the number of complex infinite prime places is equal to [k,: P] 2° as the 


following inclusion shows 


*For details see the Paper of M. Moriya mentioned under no, 1}. 

‘For the class field theory over finite number fields see the Tract of H. Hasse in 
vol. 35 (1930) of the Jahresberichte der Deutschen Mathematikervereinigung. 

5 The following formulae can be found in Hasse’s Tract, part Ia, § 15. 

°Cf. Hasse’s Tract, part II. 


0. F. G. SCHILLING. 
PC + C Ck. 


Now let {a} be the multiplicative group of all numbers « in ky which are 
prime to f. The group {#} contains the subgroup {w} of all numbers o in k, 
for which » = a! (mod f) and (w) =r! where r are non-principal ideals of k, 
which are relatively prime to f. (Written as r-~1(k,).) Then the index 
[ {a}: {w}] is equal to the m(k,.)-th power of J. Finally R(ko, 1, foo) becomes 
equal to (JS —1)(/—1)-! where 


— w(p) + + m(ko) + e(i)f(i) — (r(bo) —1) 
—w(p) + w(t) + m(ko) + [ho: P] 27. 
Now we wish to calculate the expression 9’ in k’ with respect to the same 
modulus fo. The prime ideal p may be decomposed as p = "Th ve ’ where 
j= 


g(..) =g(); then w(p’) = w(p) and the exponents e(7) can be normalized 
to one because they do not matter in the determination of S’. For the 


gti) 
prime ideals we obtain similarly and goes over into 
j=1 


IL . As a simple calculation shows, we have 

v(t)e(t,7) = Vis) = e(t, j) w(keo, ; 
hence w(f’) = w(f). 

Moreover, m(k’,) = m(k,). This is seen as follows. Evidently it is 
sufficient to show that an ideal r + 1 (k)) cannot become a principal ideal in /’. 
Assume that r~ 1 (ko), then N(k’,k|v) ~1 (ko). According to our 
assumptions, (n’,/) 1; hence there exist two integers c and d such that 
cn’-+-dl—1. This leads to the relation rt = — 1 (kp), 
for r'~ 1 (k,). But this is in contradiction to the assumptions on the ideal t. 
Thus we obtain 

+ + m(Ho) + 2 
=w(p’) + + + P] 27, 


so that certainly 8S’ > S for n’ >1. Thus we have 
l, foo ) > EK (ko, l, f 


Now let & be an infinite algebraic extension of hk, such that N. . (k, P) 
is prime to l. 


Lemma 2. The number of cyclic extensions 3 of degree 1 over k whose 
conductors are divisors of foo is infinite. 


| 
I 
| 
f 
if 
n 

0 

OV 

ky 

a 

Wi 


SOME REMARKS ON CLASS FIELD THEORY. 417 


Proof. Hach algebraic extension K = k(#) of k is evidently the join of k 
and a finite field Ky. For take some finite subfield ky of k such that the 
coefficients belonging to the irreducible equation of 9 in k lie in it; then ko(#) 
is a field with the asserted property. Thus all cyclic fields Z of degree / are joins 
of finite cyclic fields Z; over k; Ck with k. Now R(ki,1, foo) > R(ki-, 1, foo) 
for sufficiently high 7, because ([ki: ki+],/) =1 as a consequence of 
(V,,(% P),/) =1. Thus there exist infinitely many cyclic fields Z with the 
described property. 


THEOREM 1. If k is an algebraic number field such that for a prime 
1 2 the class field theory holds, then k is infinite if and only if there exist 
infinitely many cyclic superfields Z of degree | over k whose conductors are 
divisors of a given modulus fo lying in a finite subfield of k. 


Proof. Without loss of generality we can assume that & contains the J-th 
roots of unity, because k(¢:) is at most of degree 1—1 over k. Therefore the 
assumptions on are carried over to k(£:), Ng P) = P). 

Now if & is infinite we denote by ky some finite subfield of k such that 
it contains the /-th roots of unity and that N(k,k)) is prime to J. The 
modulus foo shall be chosen as a modulus of the type investigated before; then 
from the fact that for / the class field theory holds—that is to say, all cyclic 
fields of degree / over k& can be described uniquely by class groups of ideals 
in k—it follows that (NV, .(&, P),1) = 1. Hence Lemma 2 can be applied on k, 
it asserts that there exist infinitely many fields Z whose conductors divide fo. 
Conversely, if there exist infinitely many fields Z with these properties then / is 
infinite. This is obvious because a finite field never possesses an infinity of 
cyclic superfields of degree whose conductors divide a fixed modulus fo. 


THEOREM 2. If K 1s a class field of degree n over the infinite algebraic 
number field k then there exists for every divisor f of n, where f is the order 
of an element of the Galois group G(K,k), an infinity of prime ideals p in k 
which are prime to the discriminant of K over k and whose prime divisors % 
in K have the relative residue class degree f. 


Proof. We chose a finite subfield &, in k such that N(k, ko) is prime to n; 
this is always possible because K was assumed to be a class field of degree n 
overk. If K —k(#) then there exists some finite field k; in k which contains 
k, such that K — Kik where Ki; —k;(#). According to the theorem on 
arithmetic progressions in k, there exist infinitely many prime ideals p(7) in & 
which are prime to the discriminant of K, over ky and whose divisors $ (7, j (7) ) 
in K; are of relative residue degrees f over ki, if f is the order of an arbitrary 

13 


418 0. F. G. SCHILLING. 


element S; of the Galois group G( Ki, ki) = G(K,k). Now the Artin symbol 
(Ki, ki/p(t)) = Si. Let ki,v be any finite extension of k; which belongs to 
the approximation {k;} of k, and let p(i + v) be any divisor of a fixed p(i). 
The translation theorem of class field theory asserts that (Kis, kisv/p (i + v)) 
is also of order f because the degree [ki,v: ki] is prime to n. The residue class 
degree of any divisor $(i1-+¥,j(1-+v)) is therefore equal to f for any ». 
Now take a sequence {p(i+yv)} of prime ideals p(i+v) such that 
C---. This sequence determines a prime 
ideal p in & and it is prime to the discriminant of K with respect to /. In the 
same fashion we determine a chain of prime ideals ®(1 + v,7(¢ + v)) in Kin 
such that 


and ii) pity) 


The limit prime ideal % of {®(i + v,7(¢ + v))} in K is then a divisor of , 


and the residue class degree of $8 is equal to f. The equalities 


for the respective residue class degrees assert according to Herbrand the norm 
relation.’ 

Now the number of prime ideals p(7) belonging to a fixed order f is 
infinite ; hence the proof of the theorem is complete. 

Now let » be an arbitrary prime ideal of the infinite algebraic number 
field k. Then p uniquely determines a valuation on the field k. The system 
of all fundamental sequences {%.}—a, in k—with respect to that valuation 
form a field k(p), the so-called derived field of k with respect to p. If k is 
equal to = k;, then the intersections p(1) =p k, are prime ideals in the finite 
subfields kj of k. Form 3k;(p(i)). This field is in general an infinite 
algebraic extension of ko(p(0)) and it is not closed with respect to p; but 


the following lemma holds. 


Lemma 3. The derived field k(p) of an infinite algebraic number field 
k =k; is equal to the derived field belonging to the field Ski (p(i)) where 
ki (p(t) ) denotes the perfect fields of ky with respect to p(1) =p ky. 


Proof. According to the construction of the valuation belonging to p in 
k the value groups of p in k& and of p’ = po 3 ki (p(i)) coincide.® 


7 J. Herbrand, “Théorie arithmétique des corps de nombres de degré infini, I. Ex- 
tensions de degré fini,” Mathematische Annalen, vol. 106 (1932). 

W. Krull, “ Idealtheorie in unendlichen Zahlkérpern,” Mathematische Zeitschrift, 
vol. 29 (1928). 


ig 
fini 
Z is 
sucl 
lie 


L 
se 
in 
de 
of 
ali 
ch 
of 
| 
the 
one 
nol 
po 
we 
alg 
alg 
for 
of th 


SOME REMARKS ON CLASS FIELD THEORY. 419 


First we show that k(p) is contained in the derived field of & ki(p(t)). 
Let {a} be an arbitrary fundamental sequence of elements a, in k, it repre- 
sents an arbitrary element of k(p). Hach of the elements a, lies already in a 
suitable finite subfield of k. Therefore the sequence {a,} consists of elements 
in & ki(p(t) ) ; it is also a fundamental sequence with respect to p’ because the 
value groups of = k;(p(i)) and & resp. k(p) coincide. Hence {a} lies in the 
derived field of (p(t) ). 

Conversely, each fundamental sequence {a’,} of & ki(p(1)) is an element 
of k(p). According to the definition of i(p(t)) each element lies 
already in a finite p(y)-adic field ku(p(u)) ~ ky; therefore we always can 
choose some element @, in k which lies arbitrarily near to a’, in the sense 
of the valuation belonging to p’. The sequence {a} is also a fundamental 
sequence of %k;(p(i)), and according to the definition of the closure of 
Shi(p(t)) we have {ay} The sequence is a fundamental 
sequence of therefore {a’,} {ay} lies in k(p). 

M. Moriya and the author have proved in 2 papers that the class field 
theories over 3 k;(p(1)) and k(p) are virtually the same. There exists a 
one-to-one correspondence between the abelian finite extensions and finite 
normal algebras over both fields respectively.° Therefore it is of no im- 
portance in which field arithmetic investigations are made. For convenience 
we shall work in = &;((7)) in the following considerations. 


THEOREM 3. If Z is a cyclic class field of degree n over the infinite 
algebraic number field k, then an algebra A = (a,Z/k) is a complete matric 
algebra if and only if A(p) = (a, Z/k) & k(p) are complete matric algebras 
for all prime divisors p of k. 


Proof. According to what we just stated the relation (a,Z7/k) K k(p)~k(p) 
is equivalent to (a,Z/k) K Ski(p(i)) ~ (p(t)). Now let ko Ck be a 
finite subfield such that N(k, ko) is prime to n; such a field ky always exists if 
Zis a class field of degree n. Now Z = k(#) ; let us take an extension k. of ko 
such that a and the coefficients belonging to the irreducible equation of # in k 
lie in ks, write Hence (a,Z/k) ~ (a, Z+/ke) 

Now assume that (a,Z/k) “+k although 


(a,Z/k) ki(p(t)) ~ ki (p(t) ) 
for all p in k, the infinite prime spots included. Then also (a, Z+/k+) + ke. 


°M. Moriya and O. F. G. Schilling, “ Zur Klassenkérpertheorie iiber unendlichen 
Perfekten Kérpern,” and an additional note. Both to appear in the forthcoming volume 
of the Sapporo Journal. 


420 0. F. G. SCHILLING. 


According to the fundamental theorem on normal algebras over finite number 
fields there exists at least one prime ideal p. in ks such that 


(a, x ks (p+) kes (p+) 
The field k+(p+) is a subfield of = &;(p(i)). Our assumptions yield 
(a, Z+/kee) XK kee (Po) K Ski (p(t)) ~ (a, K Ski (p(i)) ~ 3 ki (p(1)), 


that is to say that ki(p(7i)) is a splitting field of (a, Z+/k+) K ks(p-). The 
splitting must already be finished in a finite extension k,(p(A)) of k+(ps) 
because & k;(p(7)) is algebraic over k+(+). Hence according to the local class 
field theory over finite p.-adic fields, the degree [k)(p(A) ) : k«(p+) ] must be a 
multiple of the exponent belonging to the algebra (a, Z«/k+) XK k+(pe). The 
latter is a divisor of nm and certainly different from one if 


(a, K ke( Po) ke (pe). 


Hence ([k,(p(A) ): and a fortiori ([k,:k],n) A1. But 
this contradicts the choice of k» in k. Therefore we must have 


(a, Z+/k») X pe) ~ 


and hence p- cannot be a ramified prime ideal. This means that (a, Z+/k+) ~k 
and a fortiori that (a,Z/k) ~ (a, Zs/k+) K k ~k.—As always the converse 
is trivial. 


Remark. Theorem 3 holds of course for arbitrary simple algebras A over 
the field & because they all possess cyclic representations. 

Now let k = 3k; be an arbitrary infinite algebraic number field. We 
assume that there exists an algebra A of degree n over k which is not iso- 
morphic with a complete matric algebra over k. The algebra A then can be 
represented in the form Ay X k where Ay is an algebra A« over a suitably 
chosen finite subfield k. of k. Then we have also A+ ks and therefore there 
exists at least one prime ideal p(0) =p. in the field ky = k+ which we shall 
take as field to start with the approximation {k;} of k, such that 


Ao X ko(p(0)) ko(p(0)). 


The exponent of Ay X ko(p(0)) may be denoted by m(p.). By p we denote 
any prime divisor of p. in &; then p = lim p(7) where 


p—p(0) p(1)C-- 


2° For the theory of normal algebras see H. Hasse, “ Uber die Struktur etc., . - - » 
Mathematische Annalen, vol. 107 (1933). 


0' 


th 


Ae 


ass 


for 


fi 

H 

if 
Ot 
th 
ale 
| ide 
| 
| = 


SOME REMARKS ON CLASS FIELD THEORY. 421 
And the G-degree 
[= k(p(0)) ] = = N (ik, p) 
is equal to the least common multiple of all the degrees 


(p(t) ) kis(p(t—1))]. 

LemMA 4. If A=A+ Xk 1s a proper non-matric algebra of degree n 
over the infinite field k = & k; then there exists at least one prime ideal p in k 
lying over a prime ideal p. of ke C k, such that 

N(k, ke; p) AO (mod m(p-) ). 


Proof. The relation A=A+ xX kk asserts that k is not a splitting 
field of A+. Therefore no finite extension k; of k» = ky is a splitting field of Ae. 
Hence there exists according to Hasse’s criterion on finite splitting fields at 
least one prime ideal p(0) in ko such that 


[i(p(t) ) ko(p(0))] AO (mod m(p(0) )) 
if p(7) is any prime divisor of p(0) =. in ki. If we now select a sequence 
pe —p(0) Cp(1) -C C- 
then its limit prime ideal p has the property that 
N (k, ke; p) (mod m(p-) ). 


Obviously there exist in general many prime ideals p lying over p. for which 
this relation is fulfilled. 
Now we are able to extend Theorem 3 to arbitrary fields k. 


THEOREM 4. Any normal algebra A over k of finite degree is a matric 
algebra over k if and only if A KX k(p) ~k(p) for all prime divisors p of k. 


Proof. Assume that Ak, in spite of A & k(p) #k(p), for all p. 
According to Lemma 4 there would exist a finite subfield ks of k and prime 
ideals p. and p such that N(k, ke; p) £0 (mod m(p-)). This contradicts the 
assumption A k(p) ~ k(p) which is equivalent to 

A XK Ski(p(t)) ~ ki (p(t) ), 
for the latter asserts that 


(p(j) ) : ke (p+) ] =O (mod m(p-) ) 
for suitable j. 


= 


422 0. F. G. SCHILLING. 


Finally we wish to show by an example that there exist proper division 
algebras D of degree n over certain infinite algebraic number fields k although 
the G-degree of k is divisible by n™. 

Let k, be an arbitrary algebraic number field, and let n be an arbitrary 

positive integer. Suppose that p(0)” (v—=1,2,---,s) is any finite set of 
prime divisors in k containing at least one prime ideal and such that we can 
attribute to each p(0)” a rational fraction of maximal denominator n p(0)” mod 1 
for which > p(0)”=0 (mod 1), where one of them has exactly the denomi- 


nator n. Then there exists a uniquely determined division algebra D, over ky 
in which the p(0)” are ramified and which has exactly the exponent n. We 
now proceed to construct an infinite algebraic extension & of ky such that 
Do X k is still a division algebra and such that N(k,k,) =n. According 
to a theorem of Grunwald there exist infinitely many abelian fields k‘ of 
degree n over k such that the prime divisors p(0)” are totally decomposed in 
each of them.’ According to the criterion on splitting fields none of the fields 
k* is a splitting field of D. The fields 


define an infinite field & of G-degree n over ky. The field & is obviously not a 
splitting field of D, because no finite subfield of & is splitting field. For the 
same reason J remains a division algebra. We may mention that 


k(p”) = ko(p(0)”) 
for a prime ideal p” = lim p(i)’, p(i)”|p(0)” in ky. 


THE INSTITUTE FOR ADVANCED STUDY, 
PRINCETON, N. J. 


11: W. Grundwald, “ Ein allgemeines Existenztheorem fiir algebraische Zahlkérper,” 
Journal fiir Mathematik, vol. 169 (1933). 


ti 

| 

it 

tt 

al 

be 

if 

ke 

pr 

ze 

8e 


ON THE ADDITION OF CONVEX CURVES. II.* 


By RicHARD KERSHNER. 


By the vectorial sum C,(-+)C, of two convex curves C, and C, is meant 
the set of all points which may be represented in at least one way as the vec- 
torial sum of a point on C, and a point on C,. It has been shown by Bohr? 
that C,(-++) C2 is either the closed interior of a convex curve Cz or is the closed 
annular region between two convex curves Cg and C;, where C; lies wholly 
within Cy. The outer boundary Cy of C,(-+)C2 was discussed by Haviland,? 
who found very precise relationships between Cg and the component curves 
C,;, Ce by the use of the Minkowski supporting function. For example, if 
(, and C2 each possess a continuous positive radius of curvature then so does 
Cy and, in fact, if p.(), p2(9),, (8) are the radii of curvature of Ci, C2, Ca 
respectively, at the point of C,,C2,Cz where the oriented normal has the 
inclination 6 then p,, (9) = pi(6) + p2(8). 

Recently the author * has investigated the inner boundary curve C; of 
C:(+)C.. The results obtained are similar to those of Haviland but the 
methods are essentially more complicated. The great distinction between the 
treatment and the results in the two cases is illustrated by the fact that the 
curve C; may possess corners while both C, and C, are analytic.* This fact 
contrasts remarkably with the above remarks concerning the radius of curva- 
ture p,,(@) of Ce. 

The purpose of the present note is a discussion of the possible existence 
of corners in the curve C;. Specifically, it will be shown that if C; and C, 
are analytic curves, then C; can have but a finite number of corners. (It will 
be shown that this number may be arbitrarily large). On the other hand, 
if it be only required of C, and C, that they possess radii of curvature which 


* Received February 28, 1937. 

7H. Bohr, “Om Addition of uendelig mange konvekse Kurver,” Danske Videns- 
kabernes Selskab (Forhandlinger, 1913), pp. 325-366. For a short presentation of the 
proof of this fact cf. B. Jessen and A. Wintner, “ Distribution functions and the Riemann 
zeta function,” Transactions of the American Mathematical Society, vol. 38 (1935), p. 69. 

*E. K. Haviland, “On the addition of convex curves in Bohr’s theory of Dirichlet 
series,” American Journal of Mathematics, vol. 55 (1933), pp. 332-334. 

®R. Kershner, “On the addition of convex curves,” American Journal of Mathe- 
matics, vol. 58 (1936), pp. 737-746. 

*Cf., e.g., R. Kershner, “On the values of the Riemann {-function on fixed lines 
¢>1,” American Journal of Mathematics, vol. 59 (1937), pp. 167-174. 

423 


h 

y 

1 

0 

t 
g 

8 

a 

e 


424 RICHARD KERSHNER. 


can be differentiated infinitely often, then it is possible that C; have an infinite 
number of corners. Finally it will be shown that if C, and C, have each a 
continuous radius of curvature then the corners of C; are nowhere dense. 

In the sequel it will always be assumed that C, exists. Then ® one of the 
curves C’,, (’, may be placed in the other, after a rotation through the angle 
about the origin, by a translation. It will be assumed that C, is the “ larger ” 
of the two curves so that Cz, may be placed in C,, in the manner indicated 
above. By a point of C,, C2, Cy, in the direction 6, or, briefly, a point 6, 
is meant a point where the oriented normal has the inclination 6. Every point 
of C7, except a corner, has a direction in the cases to be considered. A corner 
of C; will be said to have the direction (6;, 6.) if 6, and 6, are respectively the 
lower and upper limits of the directions of points in the neighborhood of the 
corner. 

Using these notations we prove 


Lemma I. If C; has a corner in the direction (6,, 62) and if C, and C, 
have continuous, positive radi of curvature p,(0) and p.(@) respectively, 
then p:(9) =p2(0+ 7) for some 0 in 6, << 0 < 62; and, on the other hand, 
if pi(9) Sp2(6+ 7) for some interval 7, << 0 < then Cy has a corner in 
the direction (6,,0.) where < by. 


Proof. In terms of the mechanical interpretation ® of C; it is clear that 
the existence of a corner of C; in the direction (6,, 0.) means that the curve (,, 
after being rotated through an angle a about the origin, may be placed within 
C’, in such a way that it has internal contact with C, at the two points of C, 
which have the directions 6,, 62. Lemma I follows’? immediately from this 


fact. Lemma I gives immediately 


THEOREM I. [f C, and C, are analytic curves then C; has at most a finite 


number of corners. 


For suppose C; had an infinite number of corners. Then the function 


pi(9) — pe(0 + 7), where pi(@) is the radius of curvature of C;, would have 
zeros clustering as some particular point 6. But since p:(@) is regular analytic 
this would imply p:(@) =p2(@-+ 7) and C; would not exist. 

Now if p(@) is a positive, continuous, periodic function of 6, with period 


5 R. Kershner, loc. cit. 3, Theorem I,, p. 738. 

®*R. Kershner, loc. cit. 3, p. 741. 

7 Cf., e. g., S. Mukhopadhyaya, “ Circles incident on an oval of undefined curvature,” 
Té6hoku Mathematical Journal, vol. 34 (1931), pp. 115-129. 


| 

j 
( 
| 
| 
4 
| 
te 
t] 
C 
Ve 


ON THE ADDITION OF CONVEX CURVES. II. 425 
2r, then there will exist a closed convex curve of which p(@) is the radius of 
curvature, if and only if the closure conditions 


) 


27 
(1) p(0) cos = 0; sin 6d0 = 0 
( 0 


are satisfied.* Using this fact it is very easy to show that the number of 
corners of C; may be arbitrarily large even when both C, and C;, are analytic. 
For let C. be a circle of radius 7. Let 


(2) (6) =r+1-+ cos n6 — (n == 2,3, -) 


where 6, > 0 is chosen so small that, first of all, pi‘"’ (6) is everywhere positive, 
so that ((1) being obviously satisfied) there does exist a closed convex curve 
C,™ having p,‘"’(@) as radius of curvature; and secondly that C, may be 
placed entirely within C,‘". In order to satisfy the first requirement on 8, 
it is obviously enough to choose 6, <r. To see that the second requirement 
may be satisfied it is enough to notice that if 6, —0 then the corresponding 
C(,™ has a radius of curvature never less and sometimes greater than r so that 
C, can be placed entirely within C,“> by a known theorem.® Now let 8, be 
fixed satisfying the above requirements and consider the corresponding C,“’. 
There are clearly m disjoint 6-intervals in which the radius of curvature (2) 
of C,™ is less than r. Then, by Lemma J, the inner boundary C;" of the 
vectorial sum (C,‘")(+-)C, will have a corner in the directions (6;,6’;) for a 
set of direction intervals including these n disjoint 6-intervals. In general it is 
not true that two disjoint intervals in which p,‘" (6) S p2(@-+ 7) correspond 
to distinct corners but in this case the symmetry of C,‘“” makes it obvious that 
the » intervals mentioned above actually correspond to n distinct corners. 
Thus, the inner boundary C; of the analytic convex sum C,™ (+)C2, where 
C,™ is defined by its radius of curvature (2) and C, 1s a circle of radius 1, 


has n corners. 


THEOREM II. There exist convex curves Cy, C2 whose radi of curvature 


(6), po(@) have infinitely many derivatives and which are such that the 
vector sum C,(+-)C. has an inner boundary Cy with infinitely many corners. 


Proof. Let C. be again a circle of radius so that p2(@) =r. Let 
be an infinite sequence of 6-values such that 


(3) 
Let 
(4) p1(9) if Von = 0 =} Bons 5 (n = -) 


®W. Blaschke, Kreis und Kugel, Leipzig (1916), pp. 115-116. 
°Cf., e. g., W. Blaschke, loc. cit., p. 116. 


j 


426 RICHARD KERSH NER. 


and 
(5) pi(@) =r -+ hn(6) if Bones > (n =0, 


where hn(@) > 0 is a function *° (defined only on the interval O2n.1 > 0 > Oon.2) 
for which all derivatives (and the function values) exist, and approach zero as 
—O or + 0, and such that the first n derivatives are less 
than (@2n,2)* in absolute value. Thus (6) is defined by (3), (4), (5) for 
the interval 0< 67/2. Let the definition of p,(@) be completed by the 
requirement 

(6) pi(9) is periodic of period 7/2. 


Now :(@) is differentiable infinitely often. This is obvious in the interval 
0<@57/2. Thus, by (6), it is sufficient to show that all derivatives exist 
at the point @ 0. But, by (4) for n = 0, the left-hand derivatives at 6 = 2/2 
and hence, by (6), at 60 are all zero. On the other hand, by (3), (4), (5) 
and the definition of hn(@), the right-hand derivatives are all zero also. For 
suppose it has been proved that the k-th derivative at 6 = 0 exists and is zero, 
then the + 1)-th difference quotients are bounded by > 0. 

Now, by (6), the closure conditions (1) are obviously satisfied so that 
there exists a convex curve C, of which p,(6) is the radius of curvature. The 
circle C, of radius r can be placed entirely within C, since pi(#) =r. Thus 
the vectorial sum C,(-++)C, does have an inner boundary curve C;. But if the 
“ mechanical ” interpretation of vectorial addition mentioned above be remem- 
bered, it is clear that each of the intervals involved in (4) will be directions 
of distinct corners of this inner boundary C;. This completes the proof of 
Theorem II. 

It is noticed that the example was so constructed that the points of C, and 
C, which corresponded to a cluster point of corners of C; were points where the 
radii of curvature of the two curves were equal. This fact could not be avoided. 
In fact, it is a direct consequence of Lemma I that if the two curves C, and C,; 
have continuous radii of curvature p;() and p2(6) and if the inner boundary 
curve of C,(-+-)C, exists and has infinitely many corners clustering in the 
direction 4) then p:(%.) =p2(6 +7). Thus 


THeorEM III. Jf C, and C, have continuous radii of curvature and tf 
the vectorial sum C,(+)C2 has an inner boundary curve C; then the corners 
(if any) of Cr are nowhere dense on the inner curve. 


THE JOHNS HOPKINS UNIVERSITY. 


1° Such a function may be taken in the form 


h,(9) =k, expl(9,,,, — 9) —9) ] 


where the constant k, > 0 is chosen sufficiently small. 


fo 


bi 
I 
i th 
th 
| al 
in 
{ th 
se 
} su 
q 
th 
th 
| 
is 
T 
(1 
Me 


REAL CANONICAL BINARY TRILINEAR FORMS.* 


By Rurus OLDENBURGER. 


1. Introduction. In 1922, E. Schwartz! found all of the canonical 
binary trilinear forms for the class of all non-singular linear transformations 
in the complex field, and distinguished them by means of algebraic invariants. 
In 1982, the author found these canonical forms independently, and classified 
them more briefly according to arithmetic invariants.? In the present paper 
the author obtains all of the canonical binary trilinear forms for the class of 
all non-singular linear transformations in the field of reals, and a complete 
invariant system. ‘The number of canonical forms is finite and is one more 
than the number of such forms for the complex field. The method of treat- 
ment depends on the use of arithmetic invariants. 

Explicitly, the problem solved in this paper is the following: Given two 
sets of real constants Grst, Arst, 7, 8, = 1, 2, find the conditions on Grst, Arst, 
such that there exist real solutions p,, gs%, m7 of the equations 


Great = (7, 5, t,p,0,7 = 1, 2), 
for which the determinants | p,?|, | ge” |, | mz7 | are not zero. 


2. Definitions. In another paper the author*® defined and made a 
thorough study of ranks of n-way matrices and associated forms. A few of 
these definitions are given here. The rank r; of a 3-way matrix A = (aijx), 
1,7, = 1,2, and its associated trilinear form F = 1, j,k =1, 2; 
is the rank of the 2-way matrix 


A111 A112 A122 


The ranks rj, 7; are defined similarly. Assume that aij, (1, 7,4 = 1,2) are not 


* Received June 16, 1936; revised November 11, 1936. 

1E. Schwartz, “ Uber binire trilineare Formen,” Mathematische Zeitschrift, vol. 12 
(1922), pp. 18-35. 

*R. Oldenburger, “ On canonical binary trilinear forms,” Bulletin of the American 
Mathematical Society, vol. 38 (1932), pp. 385-387. In this paper a complete biblio- 
graphy of earlier papers on binary trilinear forms is given. 

*R. Oldenburger, “ Composition and rank of n-way matrices and multilinear forms,” 
Annals of Mathematics, vol. 35 (1934), pp. 622-657. 

427 


428 RUFUS OLDENBURGER. 


all zero. The 3-way rank r[jk,i| of A and F is defined to be 2 or 1 according 


as the quantities.‘ 


| A111 A112 


(1) 


212 | A111 | | A112 
| A121 A222 | 


| A122 | A222 | | 


are not all zero or are all zero. The ranks r[1j,k], r[ik, 7] are defined simi- 
larly. Evidently r[jk,i] =r[kj,7]. These ranks are invariant under non- 
singular linear transformations on F. 

In this paper two binary trilinear forms F = G = py 
aud their associated matrices will be said to be equivalent if there exist 
transformations 


where the square matrices (dip), (bj), (Cxr) of the second order are non- 
singular, and these transformations bring F into G. Similarly if Ff, G are 
bilinear forms. 


3. The canonical forms for which r; = —2. By a theorem of 
another paper if one of the ranks r[ij, k], r[ jk, +], r[ik, 7] of A is 1, at least 
one of the ranks r;,7;, 7, of A is 1. Hence 


r[tj, k] =r[jk, 4] = rik, 7] = 2. 


Let Ay = (dijx), As = (Gejx), =1,2. Since the coefficients of p’, po, 
o” in the determinant | pA, + cA, | are the quantities (1), it follows that 


| pA, + cA, | 


If A, is non-singular, while A, is singular make the transformation 2, = 72, 
%=—=w2', on the form F. If A,,A, are both singular and the matrix 
(pA, + is non-singular, then po 0. Let the bilinear forms 
J, =1, 2, be denoted by F,, respectively. Let 


denote a form for which 
F’, = pF, + oF 2, = F., 


~0. EquatingF’ to F =2,F,+ 


where p, o are chosen so that | pA, + oA, 
we obtain 


‘These are 3-way determinants of the second order. 
®R. Oldenburger, Annals of Mathematics, vol. 35 (1934), p. 649. 


q 
| 
( 
( 
| 


Us 


REAL CANONICAL BINARY TRILINEAR FORMS. 429 


2 
( ) Lo = on’; 


The non-singular transformation (2), therefore, reduces F to a form F” for 
which (a’1;,) is non-singular. Since in every case F is equivalent to a form 
PF’ =~7',F’, + 2’.F’., where F’; is non-singular, it is no restriction to assume 
in what follows that F; is non-singular. 

The pair of bilinear forms F’,, F, is now equivalent in the field of reals 
to the canonical pair ° 


(3) Py = + Yo22, Py = + + bY 22. 


It is to be noted that any pair of binary bilinear forms is rationally equivalent, 
in the non-singular case, to the pair (3) or to the pair 


(4) + Yo%e, + 


Now F,, F, are not equivalent to (4), since, then, the form FY = 2,F, + 22F,2 
has r; = 1. 

In what follows in this section, we shall assume that F = 2,F, + 22F2, 
where F’;, F, are as given in (3). We shall consider three cases. 


Case 1. b? + 4a > 0. In the field of reals, the determinant 
D =| pA, + cA, | factors into distinct linear factors (ap + Bo), (yp + 8c). 
Let 
(5) p Bo, o =yp+ do. 


Then D=p’o’. By another paper of the author’ the transformation (5) 
corresponds to a non-singular linear transformation on the z’s of F, giving a 
new form = a’; for which 

, 
(6) | pa’ + | = 
Since the coefficients of p’? and o” vanish in (6), the 2-way matrices 
(@’2jx) ave singular. 'The form F’, = is evidently equivalent 


to a form 


7 4 
Simultaneously FP.’ = a’;xyj;2, transforms into a form 


°L. E. Dickson, Modern Algebraic Theories, pp. 89-97. 
7R. Oldenburger, Transactions of the American Mathematical Society, vol. 39 
(1936), pp. 432-433. 


— 
— Yi 


430 RUFUS OLDENBURGER. 


Hence F” is equivalent to 


Since F,” is singular, we can write 
— (ays”” + + 8"). 


If 8, 80, making the non-singular transformations 


Yo = ay, + Bye’, 


on F’’, we obtain the canonical form 
R= L1Y121 + LoYo%e, 


where, for simplicity, the primes on the variables have been removed. 
If = 0, we can write 
RF” yi B, 


where B is a bilinear form in the z’s and 2’s; whence the rank 1; of F” is 1. 
Similarly if 0 
Fr’ — 


where Q is bilinear in the z’s and y's and 7; of F” is 1. In either case we 
obtain a contradiction of the assumption 7; = r; = 2. 


Case 2. 4a—0. In this case, | pA, -+A,| is a perfect square. 
| The form F = 2,F, + 2.F, defined by (3) is now 


P= 21 + Yo%2) + L2(Yi%2 + 
where g = 6/2. Assume that b ~0. Making the non-singular transformations 


on F, we obtain 
L = 121 + + 


where we have dropped the primes in L. 
If b=0, then a=0. Interchanging and 2. in F, we obtain L. 


Case 3. b*+4a< 0. In this case, | pA, + cA, | does not factor in the 
field of reals. Let 


J 

4 

| 

| 
| 

| | 

= 9 Le = 2's, 
n= — (9. + 
= #2)/9, Zp == -— | 


REAL CANONICAL BINARY TRILINEAR FORMS. 431 


where 


We shall prove that F —2,F,+ 2.F., defined by (3), is equivalent to M. 
Apply to M the non-singular linear transformation 


(8) v's =. pt, = 02; EX>. 
This gives the new form 
M’ = (pF’; + + + €F’2). 


For F' to be equivalent to M in the field of reals, it is evidently necessary and 
sufficient that there exist real quantities p, 0, 7, €, such that, if we write 


po 
A 
then 
(9) A~0, 


and there exist real non-singular transformations on the y’s and 2z’s so that M’ 
becomes F'. Then there must exist real values of p, o, 7, satisfying (9) such 
that the pair of bilinear forms pF’, + oF’., rF’, + €F’2 is equivalent under 
non-singular transformations on the y’s and 2’s in the field of reals to F,, F2. 
This equivalence is satisfied if and only if these pairs of forms have the same 
invariant factors.® Let 


10 01 01 


The characteristic matrix of F,, F, is 


which has the unique invariant factor 
(10) | | = + — 
The characteristic determinant of pF’, + oF”’2, rF’; + €F’, is 


(11) | + + €4)| 
= 70 (p, + (p, 7, €) é), 


where Q(p, B(p, 0,7,§) = (pr of). 


*L. E. Dickson, Modern Algebraic Theories, p. 115. 


| 


432 RUFUS OLDENBURGER. 


The invariant factor (10) is equal to the corresponding invariant factor of 
pF”, + oF”’., rk’, + éF”, if and only if the coefficients of (11) are proportional 
to those of (10). Then there must exist real values of p, o, 7, € satisfying 
(9), and a real k ~0 such that 


(12) Q(p, a) =k, é) == — ka, B(p, 0, T; é) = kb/2. 


Since 6? + 4a < 0, we have a< 0. It follows, since Q(p,o) is a positive 
definite quadratic form, that Q(p,a) represents k,—ka where k>0. If 
Pi» 1, 71, €; are a set of real values satisfying (12) for a given real k ~0, the 
real quantities 


O71 T1 


satisfy (12), where in (12) we set k =1. We therefore restrict our study of 
solutions of (12) to the case k = 1. Solving (12,), (12.2) with k = 1 we obtain 


(13) p=tV1—o’, r= + V—a— @. 

Substituting the solutions (13) in (12;,) with k —1, we obtain 
(14) + VY (1—o’)(—a— &) = (b — 208) /2. 

Set 


(15) 


Since a < 0, € is real if k, is real. Assume henceforth that k, is real. 


stituting (15) in (14), we obtain the following solution 


—2V—a 


and from (13) the solution 
(17) 
Since b? + 4a, a < 0, the quantities o, r are real if and only if 


(18) 


r=+ Va(k,?—1). 


= 1. 


Substituting for p, €,r from (13,), (15), (17) in A as given above (9), 
we find that A= 0 if 


Substituting (16) in the relation 


om+ ky, 


> 

| 

| 

T 
| 

| ( 

| 


of 
in 


)- 


REAL CANONICAL BINARY TRILINEAR FORMS. 433 


transposing terms, squaring, and simplifying, we obtain 


, — (b? + 4a) 
(19) ) 
Since the right member of (19) is not zero, if there exist solutions of (19), 
the left member is also 0 and, for a given value of the + sign, (19) is of 
the form 


= B, %, BAD, 


which has at most two real solutions for /;. 

By (13,), p is real if and only if o* = 1; whence, by (16), assuming that 
k,? = 1, so that o is real, 
< bk; + (b? + 4a) (k,? —1) 

2V—a 


(20) 


Taking the value of the + sign in (20) to be +, the right inequality of (20) 
can be reduced to 
(21) V (b? + 4a) —1) S 2V—a — 


If b > 0, the right member of (21) is = 0 for 


(22) Vv — 


and, if b < 0, that member is = 0 for 


(23) = 


If the right member of (21) is = 0, and k,? = 1, we can square both sides of 
(21). Simplifying the resulting inequality, we obtain 


(b—2V—a ky)? =0, 


which is satisfied for every real value of i. 

If b=0, c= + V1—k,?, and p=+h,, whence p is real for every 
teal value of i. 

Evidently, there is an unlimited number of real values of k, satisfying 
(18), (22) or (23), and not satisfying (19). Also, for any solutions of a, é 
from (15), (16), the + signs in p,z can always be chosen so that (14) is 
satisfied. We have now proved that in every case we can chose k, so that 


al 
ve 
1e 


434 RUFUS OLDENBURGER. 


p,o,7,€ are real, ASA0, and (12) is satisfied. Hence F is equivalent to M 
for all a,b such that b? + 4a < 0. 


4. The canonical forms for which 7; — 1. Assume that 7 = 1, 
—=2. We can reduce the form F = =1, 2, at once 
to z,B where B is bilinear in y and z and of rank 2. Reducing B to canonical 
form we obtain 

+ 


Assume that 7; =7; == 1. F can be reduced at once to 
K 219121. 


No form with r; = 1, 2 exists. We have therefore treated all 
cases. 


5. Fundamental theorems of equivalence. We have proved 


THEOREM 1. Two binary trilinear forms F jz, and G =) iy’ 
are equivalent in the field of reals, if and only if they have the same ranks 
ri, rj, and rz, and, if 7 =1j =r, = 2, the determinants | parjx + odejx | and 
| poijx | have both 


(a) distinct real linear factors, 
or 

(b) coincident real linear factors, 
or 

(c) no real linear factors. 


THEOREM 2. In the field of reals, a binary trilinear form F = ajjxuiyj% 
is equivalent to one of the following canonical forms: 

(a) R= ayyi%1 + LeYo2%e, tf 14s = 1] = Te = 2, and part (a) of Theorem 1 
is satisfied ; 

(b) L = + + if — 71; = 1%, —2, and part (b) of 
Theorem 1 1s satisfied ; 

(c) M = + + — LoYors, tf = Ty = 2, and part 
(c) of Theorem 1 1s satisfied; 

(d) H = 214121 + tf = 1, = = 2. 


(e) K= 114171, af 


6. Note concerning M@. In the theory of forms, an arithmetic invariant 
called “ factorization rank ” plays an important réle. The factorization ranks 


fo) 


| 
ig 
| 
a 
| 
iF 
j 


REAL CANONICAL BINARY TRILINEAR FORMS. 435 


of R, L, H, K have been studied elsewhere ® by the author. The factorization 
rank of M is 3, since the matrix (mij,) of M can be written in the form 


3 
(24) (mijx) = ( 2 VaibajCax); 
where 
0 01 
(dai) ={ 1 1}, (ba;) 1 07, (Cax) = 10 > 
0 1 11 


and not in the form (24), where the range of @ is 1, 2. 


%. Reductions. The transformations reducing any trilinear form to 
canonical form for the field of reals can be written down at once from the 
theory of this paper and known theory of bilinear forms. 


ARMOUR INSTITUTE OF TECHNOLOGY. 


*R. Oldenburger, “On arithmetic invariants of binary cubic and binary trilinear 
forms,” Bulletin of the American Mathematical Society, vol. 42 (1936), pp. 871-873. 


l, 
a] 
8 
d 
k 
f 
t 


A REMARK ON A THEOREM OF ARZELA.* 


By HaRTMAN. 


Let J denote a bounded interval, and {fn(z)} a sequence of functiong) 
defined on J such that (i) {fn(z)} is uniformly bounded on J and (ii) every 
fn(z) is continuous on J. Condition (i) implies that for every enumerable| 
subset C of J there exists a subsequence of {fn(x)} which is convergent at every 
point of C. Since C may be chosen dense on J, it follows from a standard’ 
theorem of Arzela that if (i) is satisfied and (ii) is replaced by the more) 
stringent condition that {f,(x) } be equicontinuous on J, then {fn(x) } containg” 
a subsequence which is uniformly convergent on J. The question now arises 
whether or not (i) and (ii) alone imply the existence of a subsequence which} 
is convergent on J. This question will be answered in the negative by proving 
a sharper statement to the effect that there exist sequences {fn(x)} which’ 
satisfy (i) and (ii) but are such that every subsequence of {fn(x)} 1s dwergent 
almost everywhere. For instance, every subsequence of the sequence 


sin sin - 


will be shown to be divergent almost everywhere. 

Let {kn} be any increasing sequence of positive integers. A theorem of 
Hardy and Littlewood (Acta Mathematica, vol. 8% (1914), p. 181) states that” 
there exists a set S = 8({kn}) of measure 1 in [0,1] such that if 6 is a point 4 
of S, then the sequence of numbers { (/n4)}, where (%n6) denotes the fractional : 
part of kn0, is dense in [0,1]. It follows that the sequence {sin 2aknz} is 
divergent at each point of S; for if 6 is a point of S, the sequence of numbers” 
{sin 27k,6} is dense in [— 1,1]. 

Obviously, the above remarks also apply if sin x is replaced by any other 
non-constant, continuous, periodic function f(z) and f(x) is defined to 


be f(nz). 


THE JOHNS HOPKINS UNIVERSITY. 


* Received January 6, 1937. 
436 


| 

| 
| 
| 
| 
| 

if 
| 
} 
i 


ons | 
ery | 
ble 
ard 
ore 
ins 
| 
ch a 
ng | 
nt 
of 
at 
ot 
al 
is 
rs 


